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IMPROVED MEDIA DELIVERY PLATFORM 
CR OSS-REFERENCES TO RELATED APPLICATI ONS 

This is a continuation application claiming priority from U.S. Provisional Application Serial No. 60/301,681 
filed on June 27, 2001, U.S. Provisional Application Serial No. 60/303,115 filed on July 3, 2001, U.S. Provisional 
Application Serial No. 60/3 12,450 filed on August 14, 2001, and U.S. Provisional Application Serial No. 60/343,159 
filed October 26/ 2001, all of which applications are incorporated herein by this reference thereto. 

■ TECHNICAL FIELD 



This invention relates to a method of delivery and play back of sound and image files for wireless and non- 
wireless electronic devices. 



BACKGROUND ART 



The general concept for delivery of sound recordings or clips and visual recordings or clips by way of the 
Internet is known and described in various U.S. patent applications. (See Bernard et al., U.S. Patent No. 5,918,213; 
Kaplan, U. S. Patent No. 5,963,916; Barbara, U.S. Patent No. 5,926,789; and Doerr et al., U. S. Patent No. 
5,949,4 1 .) Such methods are typically used to sell products to consumers. For example, a web page from Amazon.com 
allows a user to listen to samples of music for before purchasing compact discs (CD's) by mail. 

Also, cell phones may be programmed to ring with a tune of a song or musical composition, and have become 
increasingly popular. However, cellular phones currently on the market can only be either programmed to only play 
music (such as conventional MP3 type phones) or to deliver "ring tones" with an electronic chime or ring tone rather 
than an actual recorded song, human voice, or musical composition. Additionally, these ring tones must be factory 
installed in the telephone or the delivery methods just directly interface with the Internet and require the consumer to be 
on line to access and download a particular mechanical ring tone. 

At the same time, various methods have been developed and are being used to enable a phone user to make more 
effective use of the variety of telephone service now available. For example, "caller ID" function is one such feature 
which allows the recipient of an incoming call to identify the caller based on textual information provided on a 
telephone display panel. A patent to Borland, U. S.PatentNo. 6,178,230 discloses an improved telephone system and 
method that determines the identity of the person being called for a telephone having more than one user and can 
identify the person being called by sounding a distinctive ring associated with the person being called. A mechanical 
ring tone is played depending upon the caller ID signal received to orally alert the telephone user as to who is calling 
without reading the telephone's display panel. 

DISCLOSURE OF INVENTION 



The present invention provides an improved method for delivery and play back of sound and image files which 
include songs, musical compositions, and other sound recordings cartoons, movies, television shows, or any other type 
of performance, which may be copyright registered, as well as non copyright registered personal recordings (e.g., 
personal sound recordings, family photos, home movies, etc.). This new method includes the use of sound and/or image 
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clips, which can be snippets or full files, as alerts for a variety of electronic devices or for playing on a handheld device. 
A collection or library of uniquely selected and/or edited clips may also be provided to the consumer in a manner far 
more conveniently on conventional telephone equipment than previously available. 

The method provides the consumer with a unique way of accessing and browsing through selectable files which 
may be Internet based or independent of the Internet. Additionally, the unique delivery method provides a seller or 
service provider with a convenient and more efficient way of promoting and selling entire sound and image files which 
include downloadable music, movies, films, shows, and items such as records, cassette tapes, CDs, videos, and DVDs.. 

Algorithms are provided for the delivery, storage and playback of the sound files, including a delivery method 
algorithm, a parametric optimization and compression algorithm, and an error correction algorithm. 

According to one embodiment, sound files are accessed by a cellular or landline telephone for allowing the 
consumer to browse, download, hear and/ or purchase sound files or use sound files including sound clips as ringer 
sounds. In contrast to the conventional ring tones or musical chimes used to ring cellular phones currently on the 
market, the current invention provides a method for ringing cellular phones (both analogue and digital) and landline 
telephones with real sound recordings including real music, which may be songs lifted from copyright registered CD 
tracks, and may comprise human voice, various instrument sounds, and other sound effects of a high quality. Instead of 
simply tones being played the higher fidelity musical composition can be played by the telephone or other handset with 
a degree of fidelity previously unavailable using conventional methods. The high degree of fidelity is achieved using 
data compression, error correction and parametric optimization algorithms adaptable to conventional telephones and 
other handheld devices. 

A software based system for encoding the hardware of existing cellular phones at the time of manufacturing with 
delivery, storage, and playback capabilities in accordance with the present invention is provided, such that additional 
hardware is not required. (Only a suitable speaker need be required with most telephones already possessing the 
necessary quality of speaker.) The ability to provide this technology without the need for extra hardware is very 
significant, particularly to the cellular phone industry, as it is especially desirable to make cellular phones as lightweight 
and as small as possible and at the lowest cost. 

An accessory attachment to standard telephones can however be incorporated to implement the delivery, storage, 
and playback capabilities of the present invention to existing landline and cellular telephones which have not been 
encoded at the time of their manufacture, if necessary. Such accessory attachments are compatible with existing 
telephones, and may be sold separately. Also, a micro chip may be embedded in landline telephones for providing the 
telephone with browsing, delivery, storage, and playback capabilities of the present invention. 

The accessory attachment or telephone encoded with software and/or including hardware for providing delivery, 
storage, and playback capabilities as described herein, may be manufactured with embedded sound files including sound 
clips, such that a user can immediately play back the files, including use the files as ringer sounds, without having to 
first download any files. 

Additionally, upon hearing a sound clip on the telephone, a user may choose to download the entire unedited 
sound file for a fee or purchase an item associated with the sound clip (e.g., record, cassette tape, CD, video, or DVD) 
by pressing a designated button on the accessory attachment or keys on the telephone keypad. As such, sound clips 
which have been downloaded to, or preprogrammed on a cell phone, may encourage and stimulate the sale of 
downloadable files and/ or items associated with the clips by allowing the user to make an impulsive purchase 
immediately upon hearing the clips. 

-2- 
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The accessing of sound and/or image files by other electronic devices, such as home phones, computers, pagers, 
doorbells, alarms, palm pilots, watches, clocks, PDAs etc., for either allowing the consumer to browse, download, hear, 
view, and/or purchase sound recordings, image files, or associated items, or to use sound and/or image clips as alerts is 
also part of the invention and not limited to solely telephones. New electronic devices, whose independent purpose is to 
allow the user to browse, receive, store and play sound and image Files, including clips, according to the present 
invention are also described. 

A security feature may be included on such electronic devices adapted for allowing a consumer to access and use 
sound and image flies according to the present invention. This feature is designed to prevent intellectual property abuse 
by consumers' unauthorized dissemination and reproduction of copyright protected material. The downloaded chips or 
recordings are coded and cannot be downloadable or transferred to units other than the consumers' preselected layer or 
telephone. 

Also, a tracking feature for keeping a record of every song downloaded and/or each time a song is played can be 
incorporated for providing performing rights organizations or songwriters' organizations with an accurate method for 
determining royalty payments to writers and performers of music. 

Additionally, a website suitable for viewing and selecting downloading sound and/or image clips or entire files 
may be used for giving the consumer and music or image seller a unique way of transacting the sale of such files or 
other associated items such as records, cassette tapes, CD's, videos, or DVD's. The website may allow the clips to be 
stored on a user's computer, providing the user the ability to readily access the clips for downloading the clips to an 
electronic devise, using the clips as computer alerts, or playing the clips on the computer. The user may also purchase 
files or items associated with the clips through the computer and/or website. 

The system of the present invention may also allow the consumer to browse through hundreds or thousands of 
sound and/or image clips and/or files for the purpose of downloading to electronic devices with an option to purchase 
an associated record, cassette tape, CD, video or DVD, or download the full undipped sound and/or image file. 

Furthermore, the delivery of files including clips is not limited to web based applications. Unlike conventional 
methods which require computer plug-in devices for delivering and trarisferring digital music, the current invention may 
use a delivery method which allows the user to browse, download, and listen to or watch sound or image files without 
the need for hand wired plug-in devices or a computer connection to the Internet. 

BRIEF DESCRIPTION OF DRAWINGS 

FIG. 1 is a general schematic diagram illustrating the basic components of a wireless transmission system for a 
landline or cellular telephone. 

FIG. 2 is a schematic diagram of a wireless transmission system for a cellular phone, 

FIG. 3 is a schematic diagram of a board system implemented in an accessory unit of the system of FIG. 2. 

FIG. 4 is a schematic diagram of a server software system for the system of FIG. 2. 

FIG. 5 is a flow chart illustrating a data transmission method. 

FIGS. 6 A-D illustrate the electrical schematics of a mobile telephone accessory unit in accordance with the 
present invention. 

FIG. 7 illustrates an image of a printed circuit board for the accessory unit of FIGS. 6 A-D. 
FIG. 8 is an exploded side view illustrating the assembly of a cellular phone accessory unit and cellular phone 
attachment to the accessory unit 

-3- 
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FIG. 9 is a perspective view of the cellular phone accessory unit and cellular phone of FIG. 8, showing the 
cellular phone attached to the back of the phone connector and mounting of the accessory unit. 

FIG. 10 is a perspective view of the accessory unit of FIG. 9, detached from the telephone. 

FIG. 1 1 is a picture of the accessory unit of FIG. 9, disassembled from the mounting. 
5 FIG. 12 is a schematic diagram of a landline transmission system for a home telephone. 

FIG. 13 is a schematic diagram of a board system implemented in an accessory unit of a home telephone utilizing 
the system of FIG. 12. 

FIG. 14 is a schematic diagram of server software for the system of FIG. 12. 

FIG. 15 is a flow chart for an audio data parametric optimization and compression algorithm. 
io FIG. 16 is a schematic diagram of a protocol for a data transmission method with error correction delivery for a 

digital cellular telephone, illustrating individual packet acknowledgement for a full-duplex channel case. 

FIG. 17 is a schematic diagram of a protocol for a data transmission method with error correction delivery for a 
digital cellular telephone, illustrating single acknowledgement for all packets for a half-duplex channel case. 

FIG. 18 is a schematic diagram for a media file monitoring system. 

15 

M QDE(S^ FOR CARRYING OUT THE INVENTIO N 

The detailed description set forth below in connection with the appended drawings is intended as a description of 
presently-preferred embodiments of the invention and is not intended to represent the only forms in which the present 
invention may be constructed and/or utilized. The description sets forth the functions and the sequence of steps for 

20 constructing and operating the invention in connection with the illustrated embodiments. However, it is to be 
understood that the same or equivalent functions and sequences may be accomplished by different embodiments that are 
also intended to be encompassed within the spirit and scope of the invention. 

The present invention uses a unique method for delivery, storage, and play back of sound and image files which 
include songs, musical compositions, or other sound recordings, cartoons, movies, television shows, or any other type of 

25 performance, as well as personal clips (e.g., personal sound recordings, family photos, home movies, etc.). This method 
includes the use of sound and or image clips as alerts for a variety of electronic equipment, and provides the consumer 
with a unique way of accessing these files which may be Internet based or independent of the Internet. 

The present invention may include a number of modules for an overall system of delivery of music and 
audio/visual files. These modules include a server of the files accessible by way of a specialized website for viewing, 

30 selecting, sampling and downloading selected files or portions thereof or directly accessible without going through a 
website. A telephone, be it conventional, cell phone or other hand held device with access to a communication network 
can access the server either directly or through the website. Special algorithms allow the transfer of the files to the 
handset providing the high gravity recording in a file formal which allows for tracking and security against unauthorized 
reproduction. The individual elements of the invention are unique as well as the overall system of delivery tracking and 

35 security. Described below are more detail aspects of the invention and its use. 
Use of Sound and/or Image Clips as Alerts for Electronic Devices 

According to one embodiment the system allows for sound and/or image clips which are snippets of a musical 
and/or visual performance piece to be used for sound and/or image alerts in electronic devices. A library or collection 
of uniquely edited clips may be provided to the consumer for browsing and selecting files to be stored on the electronic 

40 device. The consumer may also use home made personal clips (e.g. personal sound recordings, family photos, home 
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movies, etc.). 

The sound and/or image clips may be lifted from CD's, movies, TV shows, and the like, and are actual 
recordings, which may include human voice, instrument sound, and other sound effects, rather than mere electronic 
chimes or tones as those produced by conventional cellular .phones. Electronic devices which may utilize sound and/or 
image clips as alerts include, but are not limited to cellular phones, land line phones, computers, clocks, watches, 
pagers, door bells, car alarms, palm pilots, and personal calendars. It should be understood that although using clips for 
alerts is preferable, full unedited files may also be used. 

According to one embodiment, real music sound clips are used to "ring" a cellular or home phone. A clip or 
series of clips, which the user can select, are played instead of the conventional electronic chime or ringione. Such a 
system may be implemented on conventional cellular phones, which may be analogue or digital, by downloading 
firmwear comprising algorithms for delivery, storage, and playback of the soundfiles, to the RAM element of the 
phone. Such algorithms include a delivery method algorithm, a parametric optimization and compression algorithm, and 
an error correction algorithm. Alternatively, an accessory unit that attaches to the cellular phone for implementing the 
system of the present invention may be provided. 

According to another embodiment, sound and/or image clips are used for computer alerts such as e-mail 
notification sounds. Clips may also be used to ring a doorbell. Sound clips may further be used by a clock or watch to 
sound at the beginning of each hour, similar to a grandfather clock, wherein a different sound clip may be played at each 
hour. 

The present invention allows the user to store hundreds of different alert clips on a device. Unlike conventional 
electronic equipment which hold a limited number of selectable alerts, such as a conventional car alarm or music player 
alarm clock, the present invention allows the user to choose from an unlimited number of clip files including allowing 
the user to create his own alert clips or to choose from a library of uniquely selected and/or edited files, including 
samples taken from CD's, movies, television shows and the like. 

A cellular phone, or similar device (having a processor, RAM, and flash elements) may be integrated with 
software at the time of manufacturing for implementing the system of the present invention. Alternatively, a chip may 
be embedded into the device or an accessory unit, including a speaker, which attached to the device for implementing 
the system of the present invention may be provided. The accessory unit may have an adapter connection to the device. 
Such accessory unit may be sold with several adapter outlets to enable it to fit onto a variety of different electronic 
devices. 

Sound and/or image clips may be pre-stored on the electronic device or accessory unit at the time of 
manufacturing, such that the consumer may be able to used the clips for alerts, without first having to select and 
download clips. 

A user of an electronic device, according to the present invention, may download and store anumber of clips off 
of a website via a plug-in connection of the device to the computer, or via a wireless network system such' as the 
Apple® Airport. Additionally, a non-Internet based holding system, which may be especially adapted for delivering 
clips to the electronic device or accessory unit may also be provided. Such holding system may be accessed via a phone 
dial in connection wherein a user may interact with the holding system by using the phone keypads or voice commands. 
Other controls for interacting with the holding system, such as control buttons, voice commands or text keypads, may 
be provided on the accessory unit or the electronic device itself which may be especially adapted for interacting with 
such holding system. The accessory unit or electronic device itself may also be sold preprogrammed with embedded 
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clips for demonstration use. 

Additionally, such electronic devices may be capable of receiving or sending clips directly from one device to 
another device. To prevent transferring of entire files from one device to another, a security feature may be included on 
the devices and work in conjunction with the file. 

One method of preventing the transferring of files is to encode each electronic device or accessory unit with 
unique scrambling/unscrambling wave capabilities. As such, when a user transfers an entire file to his device, say a 
cellular phone, for which he pays a fee, a scrambling wave, which may be a function of his unique telephone number, 
may be encrypted in the file. Upon playing the File, the user's cellular phone sends the corresponding unscrambling 
opposite wave. Other devices purchased by the same user may also include the unique scrambling/unscrambling wave 
encryption capabilities associated with the user's telephone number. As such, the files may only be played with clarity 
on the device or devices owned by the user, even if such files are transferred to other devices. 

The security lock mechanism allows the original music or its representative to control distribution of music, and 
also provides an opportunity for music distributors to keep track of who plays their music. As such, a method of 
accounting for royalty payments to artists and performers and other parties registered with performing rights 
organizations such as ASCAP and BMI may include providing a tracking feature on electronic devices used by 
businesses such as bars, restaurants, and clubs to play music. In addition to allowing a record to be kept as to which 
music files have been downloaded and stored on the electronic device, the tracking feature may also record information 
on how many times and when each song has been played. This allows performing rights or music writers organizations 
the ability to keep an accurate record on which to base royalty payment distributions. 

An electronic device having stored sound or image clips may include various features which allow the user to 
preprogram the clips to play in a set sequence or a random order. (For example, certain clips which may be from the 
same or different songs may play in a congruous or back-to-back order with a fixed silence time between the clips.) 
Additionally, the device may have features allowing the user to classify and arrange the clips based on categories such 
as the type of clip (i.e., movie, song, etc.), artist name, time period, etc. 

Thus, a user of an electronic device utilizing the clips according to the present invention will be able to arrange 
the clips either through a website from which the clips can be downloaded onto the device, or through the device itself. 

Additionally, after listening to or viewing a clip, the user can choose to download the entire file from where the 
clip originated (i.e. the entire movie, song, etc.). The server providing the clips and the files may provide the clips for 
free or for a small fee as inducement for the consumer to ultimately download the entire file for which a greater fee may 
be charged. 

An electronic device according to the present invention may also have the ability to receive clips which are 
directly transmitted onto the electronic device via audio or visual broadcasts. The user of an electronic device may 
program the device to sound a specified broadcast as an alert sound. For example, a sound segment from a live radio 
show (i.e., a sports show or a commercial) may be used to ring a cellular phone by either the caller or the callee. 

Advertisements may also be transmitted through the electronic devices according to the present invention. A 
message such as "pick up the phone and don't forget to drink Coca Cola™" may be used to alert of an incoming call. 
Such transmitted advertising messages need not necessarily function as alerts. 

Additionally, this invention contemplates the use of image and sound clips which can be combined such that the 
user can create a unique clip of both sound and image for use in electronic devices having display screens. For 
example, a phone having an appropriate display screen can be preprogrammed to display a visual clip of a caller 
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accompanied by sound, or a computer alert may display an image clip with sound. An image clip may comprise a single 
image frame or a picture clip as well as an animation. 

Website to be used as a Support Tool for Downloading Clips to Electronic Devices and Method for Selling 

According to a preferred embodiment, a website for downloading sound and/or image clips holds a library of 
clips, each clip having a specific identifying code or icon which may include, for example, the title of a musical 
composition or movie from where the clip originated, the name of the artist, a code number, or other type of 
identification depending on the type of clip. For example, a song clip may be listed as barrywhite@lovestuff.wav, or 
may display the picture of the song artist or CD cover of the CD on which the song appears, along with the name of the 
song. The list may be organized according to the artist's name, by music classification (i.e., pop, jazz, R&B, hip hop, 
etc.), by length of the sound clip, by the type of sound clip (i.e., song, piano music, guitar music, loud, quiet, etc.), or 
any combination of these categories or other conventional categories depending on the type of clip (image or sound). 
The website may also include categories of longer clips which may be more suitable for phone rings, and shorter clips 
which may be more suitable for computer alerts. The website may further contain a suggested list of weekly or daily 
favorite clip picks, which may be provided for each category or subcategory. Additionally, items or subcategories in a 
given category may be organized alphabetically, by year of copyright, or any other conventional order. 

Tables 1 and 2 are examples of possible arrangements for sound clips using music classification and artist name. 
Note that the listings of Table 1 such as barrywhite@lovestuff.wav are not websites, but use symbols associated with 
web use, such symbols being one of the many arbitrary ways of listing the clips. The symbol-driven website-like 
listings may end with other non-domain suffixes such as ".pop", ".song 5 *, etc. Additionally, this invention contemplates 
the use of website hyperlinks associated with each listing as shown below in Tables 1 and 2. 



R&B 


Jazz 


Rock 


Barry White 


Herb Alpert 


Chicago 


B arrywhite@lovestuff 


herbalpert@sunspots.pop 


Chicago@feelings.wav 


Barrywhite@deepvox 


herbalpert@datingame.pop 


Chicago@time.wav 


Table 1 


R&B 


Jazz 


Rock 


Barry White 


Herb Alpert 


Chicago 


BW-01 A-Sexy 


HA-01 A-Date 


C-01 A-Begin 


BW-02 A-Love 


HA-02 A-Bull 


C-02A-Search 


BW-03 A-Peace 


HA-03A-101 


C-03 A-Color 



Table 2 



The clips may also have an identifying number associated with each clip. Such identifying numbers may be used 
in downloading the clips to an electronic device using a telephone (described below) or other device having a number 
keypad. The website may further include a virtual personal locker or storage area for storing a selection of clips 
personal to a user which can be accessed on the website by a unique user identification name or code. As such, a user 
can store clips for later purchasing, downloading to the user's cellular phone, playing, etc. The website may also allow 
the user to upload personal clips such as family photos, voice recordings, home movies, and the like, to the storage 
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locker for later downloading to the user's cellular phone or other electronic equipment. The storage locker may include 
an organizer for storing the clips in alphabetical order, by various categories, or any other order. 

The website may allow for direct downloads of the clips from the website to the computer itself or to other 
electronic devices. 

To illustrate how downloading through the website may be carried out, a user operating the computer may drag 
his/her mouse over the various listed sound or image clips and click on one or more selected clips. Thereafter, a box 
can appear prompting the user to select the appropriate electronic device onto which the clip or clips are to be 
downloaded (e.g., the box may say "CELLULAR PHONE DOWNLOAD OR COMPUTER ALERT DOWNLOAD?" 
Assuming that the "CELLULAR PHONE DOWNLOAD" button is selected, a prompt for typing in the appropriate 
cellular phone number will follow. Thereafter, the selected clip or clips may be uploaded to the user's personal locker 
and made available for downloading to the user's handset. 

Other features may also be included, such as an option allowing the user to arrange multiple downloads in a 
specific order, create a folder for grouping multiple downloads, or a feature incorporated into the phone which causes it 
to ring a selected clip immediately after it has been downloaded. Additionally, clips, which have been previously 
downloaded to the phone may be deleted, rearranged, or reclassified with or without using the website. (There are other 
methods for storing clips on an electronic device such as a cellular phone, some of which are described below, including 
direct downloading access for phones without the need for web phone access capabilities). 

Alternatively, by clicking "COMPUTER ALERT DOWNLOAD," the selected clip will be downloaded to the 
user's computer, allowing the user to select many different sound and image clips for computer alerts, such as e-mail 
notification, computer alarm clock, and computer calendar notification. 

Additionally, multiple clips may be associated with one type of alert, such that a different clip is played for each 
alert event. The user will be able to rotate alert clips and preprogram or randomize their order similar to a CD stereo 
carousel. 

The user will also be able to hear or view a selected clip which will play on the website upon the user's 
command. Browsing capabilities wherein the user can drag his mouse over the sound clips library of selections and hear 
the clips without having to click or open a file may be included in the website. According to a preferred embodiment, a 
user simply drags his mouse over various clip samples, which light up or flash and play one at a time. Any time the user 
places the cursor over a category of music, the first tune in that category plays, and the icon representing that category 
of music switches to display the name of the artist and title of the song or composition being played. Once the user 
clicks upon that icon, he can select the next song and hear the song while at the same time seeing the name of the artist 
and song title. The user can cycle through all the songs within that category using this approach very quickly to not 
only browse but to also hear the music. If the user does not wish to switch over to another category of music, he simply 
moves the cursor to another icon and repeats this procedure. To select a particular song the user double clicks on the 
song, which is then included in a collection of selected songs to be downloaded later. 

The website may be used as a shopping forum where consumers can hear or view the clips and click to buy items 
associated with the clips such as music records, cassette tapes and CD's, DVD's, and movie videos, or download the 
entire sound or image file to their computer for a fee. By allowing the user to sample and download clips for use as 
alerts in electronic devices, the website will provide an attractive forum for selling items associated with the sound and 
image clips, and for allowing the user to download the entire file associated with the clip, for which a fee may be 
charged. 
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Additionally, an identifying mini icon such as the song title or recording artist CD icon associated with a clip or 

with a group of clips may appear on the computer screen at a fixed location and/or at the screen display where the clip 
plays a computer alert The icon may include a "buy" button which will allow the user to purchase an item associated 
with the clip, or download the entire file from which the clip originated by clicking on the button. Such "buy" button 
s may be a hyperlink to a website for transacting the purchase. Where a CD icon is not used, the user may click on the 
song title to purchase an item associated with the clip. If the clip comes form a song that exists on more than one CD, 
the customer will see more than one CD cover to choose which CD to buy.. 

The utility of clips as alerts for electronic equipment will provide consumers with incentive to browse the 
website and sample the clips. After hearing or viewing the clip, consumers may be induced to purchase items 
> associated with the clips, which they will be able to do instantaneously through the website by the click of their mouse. 

The website may further be used as a contest forum. The website may be set up to play mystery clips or short 
segments of sound recordings which contestants will have to identify in order to win a prize (i.e., by being the first to e- 
mail or call with the correct answer). Thus, a radio show may set up a game where a short segment of a sound clip is 
played on the radio or user's phone for contestants to guess and is also available for the listeners who want to hear it 
again on the website. 

The website may include forwarding capabilities, such that a sound or image clip can be forwarded as a greeting 
to a friend. (The security feature may be used only to prevent transferring of entire files). Consequently, the website 
will attract customers for the purpose of downloading clips to electronic devices and ultimately purchasing items 
associated with the clips. Additionally, the user may subscribe to a service such as an existing cell phone service 
provider for downloading files through their telephone, without having to be online. 

The user may also create a clip (e.g. by recording a song or personal clip) and store the clip onto a sound storage 
element in the electronic device. Additionally, the electronic device can be preprogrammed with clips selected by the 
manufacturer retailer of the device. 

Accessing of Sound and /or Image Files Without Access to Internet 

Sound and/or image files which include clips may be downloaded without use of the Internet by allowing a user 
to access a library of clips via their cellular phone or home telephone or providing other electronic devices with features 
which allow automatic access to the library. (Although Internet free accessing will be described with respect to a 
telephone, it is to be understood that the method describe may be compatible with any electronic device preferably 
having accessing capabilities similar to a telephone). 

The library may be a non-web holding unit that has files with associated codes which match the codes associated 
with the files on the website, wherein the website serves as a usable guide for identifying various files according to 
associated codes, such as numerical codes to assist the user in downloading files using voice commands or keypad 
commands. 

Additionally, cellular phone or home telephone users may access a non-web holding unit with a library of stored 
files which can similarly be browsed, selected, and downloaded onto the phone using user voice commands, key pad 
commands, or by connection to a live operator. Such unit may be accessed by dialing a phone number (e.g, an 800 
number). Home telephones and cellular phones may have separate holding units, such as a satellite for cellular phones 
and a ground unit for home phones, or a satellite can be used by cellular phones to access a ground holding unit 

To facilitate selection of the files from such holding unit, the access system may provide for a code associated 
with each file which may be obtained by browsing the website as described above. As such, a user connected to the 



03O03235A1J_> 



) ) 

WO 03/003235 PCT/US02/20443 
holding unit would access the code associated with the file to select and download the file to the user's telephone. 

Many other methods allowing a user to select files from the holding unit are possible. For example, the telephone 
may include a voice recognition feature, wherein the user can say the name or part of the name of the song he wishes to 
select (e.g., "Strawberry Fields" or the name of the song artist). The phone may also utilize hierarchical submenus 
whereby the user may press dial keys with letters corresponding to a selection in a given category which ultimately leads 
to the selection of a particular song. A phone having a screen display for providing a text listing of the names of songs 
or categories, according to hierarchical submenus, may also be used for enabling the user to narrow down to a list of 
songs and/or artists from which he can make his final selection. 

A telephone may likewise be used to deliver files stored on the phone to a website, an e-mail address, another 
telephone, or other electronic device. Sound clips, which are segments of whole songs, musical compositions or other 
sound recordings, will be used mostly with telephones, however, downloading entire music or image files may also be 
done, subject to the security feature described above. Additional revenues may be generated as the consumer accesses 
the content library and uses airtime while browsing and downloading clips or entire songs from the library holding unit. 

Furthermore, cellular phone and home telephone service providers may offer extra features to phone subscribers 
which would allow the subscribers to download and store sound files for use with the telephone in accordance with the 
present invention. Such features by service providers may include a personal sound file storage box (which may be a 
file of clips and/or entire sound files) that the user can access via a personal code. The user may be charged a monthly 
fee for a subscription to the service, and/or per downloading of each song, whether or not the user is a subscriber. 
Additional revenue can be generated by the service provider even if the service is provided without a special charge 
since consumers will use more airtime. 
Telephone Using Sound Clips 

A telephone having stored sound clips which may comprise real music including human voice, various 
instrument sounds, and other sound effects may allow the user to select one sound clip or a rotation of several clips to 
"ring" the phone. Although it is preferable to ring the telephone with sound clips, an entire music file may also be used, 
whereby for example, a song may start playing and continue until the user picks up the telephone. (Of course, entire 
music files may be played on the telephone solely for the user's listening pleasure). The telephone may also be 
programmed to ring a conventional chime if the user so chooses. Such a telephone may utilize a storage chip carrying 
stored sound clips as well as the conventional phone chime programmed onto it 

Additionally, the telephone may allow the user to determine how many times a clip is repeatedly played for each 
ring, and the time delay between clips in a given ring. The user may also choose to mix different clips in one ring. A 
telephone may also include a looping feature which rings the telephone in a looped clip such that the clip plays 
repeatedly without a pause between repetitions of the clip, or a "cluster" feature which rings a "cluster clip" comprising 
a multiple number of clip segments from a single song, musical composition, or other sound recording played in 
sequence. 

Other features will allow the telephone user to preprogram the telephone to play a certain clip when a specific 
individual calls, thereby allowing the user to identify the caller based on the chosen sound clip. Each person who 
regularly calls the user may have a unique identifying ring. This will allow a telephone user to have the option of 
assigning a unique sound caller ID to each of an unlimited number of callers. Other features may include allowing a 
caller to select his own personal sound clip to "ring" the telephone of the recipient of the call. (For example, the caller 
may sing or record a "Happy Birthday" song.) Also, a telephone used by more than one user may utilize sound clips for 
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a callee ID function wherein the caller identifies the intended callee (e.g., by dialing a digit or sequence of digits) and 
the telephone plays the clip associated with the callee; 

Additionally, a telephone may be provided with a "caller message recorder feature" which allows the caller to 
record his/her own message to send to the number dialed. For example, the caller may send a message such as "Hey 
John. It's Mary. Pick up the phone," by pressing a 'Vecord ring" button on his/her phone to send such a message to . 
John's phone. As described earlier, the telephone user, say John, may have a caller ID feature such that when a certain - 
caller, say Mary calls, the telephone rings with a predetermined message or sound clip selected by John. The additional 
caller message recorder feature may cause John's telephone to play Mary's message instead of overlaying the 
predetermined caller ID message or clip. Additionally, John may record his own message such as "It's Mary" and 
associate that recorded message with Mary's phone number for a caller ID ring. 

A telephone, according to the present invention, may also include a "sensory feature" for enabling the telephone 
to sense the level of ambient noise and adjust the loudness of the "ring" accordingly. If the phone "senses" very loud 
background noise, for example, a cellular phone located in a loud restaurant, the ring volume will increase. A feature 
for detecting whether a cellular phone is located in a pocket book or a place where the "ring" sound may be muffled is 
also contemplated. This feature will also enable the phone to adjust the volume of the "ring" such that the "ring" will be 
loud enough for the user to hear. Such detection mechanism can be achieved by detecting ambient light and other 
conditions. 

Additionally, the user may manually adjust the volume of the ring via a tunable volume control mechanism ora 
multiple fixed settings control. (Although the above features are described with respect to a phone, it is to be understood 
that these features may also be provided with other electronic devices utilizing sound and/or image clips as alerts where 
applicable). 

Transmission System for Delivery Cli n s to a Telenhnn* 

FIG. 1 is a schematic diagram illustrating the basic components for a wireless transmission system 100 for a 
telephone 102, having a wireless or landline service provider. 

The system is described in terms of two main components: a storage chip 104, and a server 106. The storage 
chip 104 is an element associated with the telephone which may be embedded into the phone or into an accessory unit 
which attaches to the phone, having abilities to interface with the phone. The existing hardware of a cellular phone may 
also be integrated with a software system which may be downloaded to the RAM element of the cell phone for 
incorporating the present invention, without the need for extra hardware. As such, the existing hardware of the cellular 
phone may be made to perform the same function of the chip. 

The purpose of the chip 104 is to store a selection of clips, allow for downloading of clips to be stored on the 
chip 104, and allow for the playback of clips, either by the telephone or the chip 104. (Although the description herein 
refers to sound clips, it is to be noted that entire sound files may be stored, downloaded, and played, according to the 
system described). Additionally, the chip 104 can associate the stored clips with a caller ID so that the particular clip to 
be played back is determined by the calling subscriber ID. 

The server 106, which is associated with a collection of stored clip files 108, is designed to execute requests of 
the chip 104, which may be given through user voice commands or commands using the phone keys. The server may be 
equipped with a voice adapter 110 for supporting the ITU-T V.253 standard and telephone lines attached to the voice 
adapter. The voice adapter can also support some standard modem protocols, like V.32 or V.34, if required for 
compatibility. 
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The server 106 also allows for files to be transmitted to the chip 104 for storage. The system 100 enables a 
connection to the server 106 upon a request from the chip 104, utilizing the phone, and/or PSTN (Public Switched 
Telephone Network), and/or a voice card (voice modem) attached to the server computer. 

The system may have a voice menu, which, after connection to the server 106, allows the user to listen to the 

5 server's menu and navigate through the system of voice menus using the telephone's Dual Tone Multi Frequence 
(DTMF) keys. The system may allow the user to select and download clips by allowing the user to listen to the clips 
presented by the seiver 106, select a clip, and issue a download command to the server 106. The server then sends the 
selected clip (e.g. in digital compressed form) using a Custom Data Transmission Protocol (CDTP) over the voice 
channel. (Illustrated in FIGS. 2 and 12). 

io The system 100 allows for storage of a large number of clip files in the chip's memory. The system's server 106 

utilizes a music compression algorithm, shown in FIG. 15, which converts common music files into compressed files 
that are downloaded and stored by the chip 106. For example, a chip supporting the storage of about 1000 clips, each 
being approximately a few minutes in length, may have a flash memory size of about 40 Mbytes. The chip 104 may 
also have a sound output element such as speakers. 

is The server comprises software which can run under Windows 98, Windows-NT OS, or other suitable system 

using a voice modem for communications. Additionally, the system may use a single modem or a pool of several 
modems. 

Preferable Embodiments for a Telephone System 

Examples of telephone systems utilizing the method of the present invention include a cellular phone which may 

20 utilize an analogue (voice-only) system or a digital system, and a conventional land line telephony network. A system 
for using a cellular network infrastructure is shown in FIG. 2. A schematic diagram of a landline transmission system for 
a home telephone is shown in FIG. 12. (Again, although the following descriptions make reference to the use of sound 
clips, it is to be understood that entire sound files may also be used as described). 

All described examples assume existence of a server preferably dedicated for servicing user requests and 

25 providing sound clip data download capabilities. A corresponding chip, implementing all required functions is 
associated with the telephone. 

The server may be a computer running Microsoft Windows or other suitable environment, such as a Pentium-ni 
PC, Win95/98/NT/2000, 128Mb RAM, 4 GB HDD. The server may store or be capable of accessing a sound clip 
database, which may be stored on a website or non-web holding unit. The sound clip database is stored in a compressed 

30 file format of those commonly known. 

A schematic diagram for a server software system is shown in FIG. 4 for a cellular phone system, and FIG. 14 
for a landline system. The software may be written in C++ under Microsoft Windows or other suitable language. The 
functions of the server software include servicing user requests via a user interface element and transmitting a selected 
sound clip through the phone line via a music clip transmission element. 

35 According to a preferred embodiment, the functions of the user interface element include decoding DTMF keys 

pressed by the user and playing the voice menu labels to the user. The voice menu interface may include hierarchical 
submenus, leading to different functions. In all examples, the user interface element can be unified in the sense that the 
voice interface and DTMF or voice recognition-based interface are independent of the type of network or type of 
phone(s) used. Implementations that utilize a single server to process requests originating from different types of 

40 networks and/or phones can therefore be built. 
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The systern of submenus leads a user to the downloading of the selected sound clip. Thereafter, control is 
transferred to the music clip transmission element for downloading sound clips into the phone. The music clip 
transmission element interfaces directly with the phone accessory unit, independent of the user. The music clip 
transmission element is dependent on the type of the phone used and the network infrastructure. 
J Example Transmissio n System for a Cellular Phone and Network 

FIG. 2 is a schematic diagram of a wireless transmission system 200 for a cellular phone 202, which may be 
either an analogue (voice-only) or digital system. In both cases, a specialized board 203, implementing all required 
functions, similar to the chip 104, is incorporated in an accessory unit 204 attached to the cellular phone. Although the 
system 200 is described as incorporating an accessory unit, it should be understood that a chip performing the same 
functions of the board may instead be embedded in the phone itself, or a software system may be integrated with the 
existing hardware chip of a conventional cellular phone without the need for additional hardware. The system 200 
further includes a server 206 and software 207 for the server. 

The cellular telephone 202 may be any commercially available cellular phone having capabilities for supporting 
a command set for general telephone control, [i.e., a V.25Ter serial asynchronous automatic dialing and control as 
recommended by the ITU-T flnternational Telecommunication Union-Telecommunication sector)] and for supporting 
V.25Ter t, +C B extensions according to the ETSI (European Telecommunications Standards Institute) ETS-300-916 
standard for obtaining codes of keys pressed by the user and for receiving caller ID information. Additionally, the 
phone 202 should have capabilities for subscribing to a cellular provider 208 with caller ID service support. 

A schematic diagram of the board 203 is shown in FIG. 3 . In an embodiment where an accessory unit is used the 
board 203 is implemented in the accessory unit 204 which can be attached to the phone 202 through a standard ; 
extension connector where other commercially available accessories such as a hands free ear set and charging adapter ■ 
are typically connected. 

The board 203 includes the following main blocks: a Digital Signal Processor (DSP) 300, a flash memory 
element 302, a Random Access Memory (RAM) element 304, an initial bootstrap chip 306, an analogue interface 
element 308, and a digital interface element 310. 

The processor 300 executes the device firmware, provides control for all other blocks and performs the 
computational tasks for the board 203. The tasks performed by the processor 300 include control of the board's units, 
monitoring of keys pressed by the user and processing of key-press events, reception of information from the computer 
through the computer digital interface, reception of caller ID information through the phone digital interface, reception 
of packed sound clips through the phone analogue or digital interface, unpacking and then playing back sound clips 
through a built-in speaker connected to the analogue interface of the accessory unit 204, support of a voice menu-driven 
user interface, and performance of other auxiliary functions. 

The flash memory element 302 contains the device firmware, and sound clips which can be pre-loaded as well as 
downloaded from the server. The RAM element 304 enables the processor to run faster and also holds buffers for 
unpacked sound fragments and processor service procedures. When the power is turned on, the initial bootstrap chip 
306 loads the device. 

The analogue interface element 308 includes a phone interface element 312 and a built-in speaker interface 
element 314. The phone interface element 312 is used for input and output of signals when downloading sound clips 
from the server 206. The speaker interface element 314, with the speaker, plays all system sounds heard by the user 
including voice menus and sound clips. 
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The digital interface element 310 includes a phone interface element 316 and may include a computer interface 

element 318. The phone interface element 316 is used for phone control and for receiving key codes and caller ID 

information from the phone. The computer interface element 318 is used for various service functions such as 

downloading preprogrammed sound clips from the computer to the flash memory. 

The functions of the server software, shown in FIG. 4, include servicing a user's requests via a user interface 

element 402 and transmitting a selected sound clip through the phone line 404 via a music clip transmission element 

406. 

A user interface element 402 is provided whose functions include decoding DTMF keys pressed by the user and 
playing the voice menu labels to the user. The voice menu interface may include hierarchical submenus which lead to 
the downloading of the selected sound clip. Thereafter, control is transferred to the music clip transmission element 
406 for downloading sound clips into the phone. The music clip transmission element 406 interfaces directly with the 
phone accessory unit, independent of the user. 

The selected sound clip may be transmitted through the phone line to the accessory unit 204 first through the 
server hard drive 408, then through the server software 207, next through the voice adapter 210, then through the phone 
line of the network to the cellular service provider 208, to the cellular phone 202, and through the analogue interface 
308 of the accessory unit 204, then through the processor 300 of the accessory unit 204, and finally, through the flash 
memory element 302 of the accessory unit 204. When the sound clip transmission is completed, the task of the music 
clip transmission element is completed. Thereafter, the phone line 404 is released and control is transferred to the user 
interface element 402. 

In an autonomous mode, the board 203 may contain a number of pre-loaded sound clips. Initially, the board 203 
is in the inactive state. The board 203 and phone 202 interact such that the phone sends to the board codes of all the 
keys pressed by the user. Upon receiving a particular sequence of codes or when, for instance, a particular key is 
pressed for a prolonged period of time, the board 203 switches to the active mode. In the active mode the board 203 
may interact with the user via a voice menu-driven interface where voice messages, via a speaker, prompt the user to 
respond by pressing a selection of phone keys indicating the user's responsive selections. The board 203 reacts to the 
user's selections by analyzing the keys being pressed. 

The clips are stored on an internal clip index which can be retrieved from the internal memory and played back 
according to key commands provided by the user. Examples of voice menu options provided by the device 204 through 
a speaker upon switching to an active mode include: 1) the user may choose to exit the active mode and enter the 
passive mode (e.g., by pressing "0"); 2) the user may choose to listen to the current sound clip on the clip index (e.g., by 
pressing "1"); 3) the user may choose to listen to the next clip on the index (e.g., by pressing "2"); 4) the user may 
choose to listen to a previous clip on the index (e.g., by pressing "3"); or 5) the user may choose to assign a caller ID 
number to the current clip on the index (e.g., by pressing "4"). 

Upon choosing to assign a caller ID phone number to the current clip, the device may switch to a sub mode with 
a sub-menu having the following options: 1) the user may choose to switch back to the previous menu (e.g., by pressing 
"0"); 2) if the selected clip was already assigned, the user may choose to get information regarding the caller ID already 
associated with the clip (e.g., by pressing "1"); or 3) the user may input a new caller ID phone number for the current 
clip (e.g., by pressing "2" indicating this choice, then dialing in the phone number followed by the sign). 

In the passive mode, the device 204 may analyze messages being received from the telephone 202. Upon 
receiving an incoming call, the device 204 checks the incoming caller's phone number against the list of assigned caller 
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ID sound clips in its memory and plays back the particular sound clip through the built-in speaker if the specified 
caller's phone number was assigned to this clip. Alternatively, the device 204 may play back a default sound if the 
particular caller ID was not assigned to any clip. 

In order to transfer digitally compressed sound clip data through the analogue channel a special method and 
algorithm to map digits to sounds is used. This method is implemented not only for a cellular telephone using an 
analogue cellular network but also for a landline transmission system of a home phone, shown in FIG. 12. 
Method for Data Tra nsmission Over an Audio Channel of a Wireless Telephone 

A data transmission method 500 for transferring data through the phone line and the receiver, based on a voice 
mode connection (versus data mode) and DTMF signal interpretation is illustrated in FIG. 5. A similar approach can 
be implemented for a landline telephone that does not have a data transmission mode. 

For transmitting data through the phone line, the transmission method 500 comprises the steps of a) data 
scrambling 502, b) data mapping 504, c) conversion of frequency symbols to time samples 506, d) addition of cyclic 
prefix 508, and e) digital to analogue conversion 510. The data is then sent through the receiver, following the reverse 
steps off) analogue, to digital conversion 512, g) symbol synchronization 514, h) conversion of time samples to 
frequency symbols 516, i) decoding frequency symbols to bits 518, and j) descrambling the data 520. 

The transmission method is used to provide enough speed for the data transmission. The transmission method 
allows simultaneous use of the voice communication and data transmission features (during one connection session) 
without having to switch the mode of connection. A customer does not need to use a Wireless Internet Service 
Provider. A user can simply place a regular call to the specific number (e.g., an "800" number) to gain access to the 
Server. The dual-mode connection allows for voice and "push button" support as well as voice recognition service. 

An orthogonal frequency-division multiplex (OFDM) modulation scheme is used for data transmission. The 
benefits of OFDM include: 1) the modulation can be made robust to Inter-Symbol Interference (ISI) by increasing 
symbol size; 2) the modulation can be made robust to impulse noise by increasing symbol size; 3) for each individual 
sub-channel, the channel's response could be considered essentially flat, minimizing the need for channel equalization; 
and 4) different encoding schemes could be used for different sub-channels, for accommodating frequency-selective 
channel distortions. 

The total bandwidth to be used by the method is determined by the worst case of supported audio channel. A 
suitable algorithm for compression of the voice channel is the GSM RPE-LTP algorithm which essentially has a built-in 
down-sampling by a factor of 3 in which the allowed bandwidth is limited at 4000/3=1333Hz. Usually channel 
response is severely limited at frequencies below about 200Hz to 250Hz. No OFDM symbol time-windowing is 
employed to minimize variations of transmitted signal amplitude envelope. 

A compressed voice channel can also introduce significant non-linear distortions. Therefore, it is not feasible to 
have a large number of sub-channels; otherwise the algorithm would be affected by significant inter-channel 
interference (ICI) due to loss of orthogonality between sub-channels. About 32 sub-Channels appear to provide enough 
symbol size while maintaining satisfactory low ICI. 
Modulation Symbol Structure 

Each OFDM symbol consists of a minimum number of samples sufficient to represent all sub-channels. To 
increase computation efficiency, a Fast Fourier Transform is employed to convert sub-channel symbols from frequency 
to time area: Therefore, for 32 sub-channels, OFDM symbol size should be at least 64 real samples (at 2666Hz rate). A 
circular prefix of 16 samples is used to improve separation between symbols, and minimize ISI (Inter Symbol 
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Interference) and ICI. Therefore, total symbol size is 80 samples at 2666Hz. 
Receiver Synchronization 

Circular extension prefix redundancy, present in the signal, is used to facilitate OFDM symbol synchronization 
in the receiver. A synchronization subsystem effectively computes auto-correlation coefficients of the received 
sequence (e.g., at 2666Hz). The output of the correlator goes through a "rectifying" phase-locked loop-like system 
which outputs synchronization impulses at the proper time instants to sample OFDM symbols correctly. 

Synchronization system induced timing jitter leads to rotation of received sub-channel phasors by increments, 
proportional to the central frequency of a particular sub-channel. This rotation is compensated in the decision scheme. 
Data Mapp ing * 

The output of a scrambler is mapped onto complex symbols (amplitude/ phase) of the OFDM sub-channels. 
Individual sub-channels use QPSK (Quadrature Phase Shift Keying) modulation. 
Data Scrambling 

Data scrambling is employed in order to provide statistically random distribution of transmitted symbols to 
reduce peak-to-average power ratio of OFDM symbols. A self-synchronizing scrambler with generating polynomial of 
l+x'^+x" 23 is used which, at the transmitter, effectively divides the data sequence by the generating polynomial. The 
coefficients of the quotients, taken in descending order, form the output data sequence. 
Example Using analogue Cellular Network and Cellular Telephone 

The above-described accessory unit 204 is provided in this example in the context of analogue (providing only 
voice channel) cellular network. . 

Initially, the device 204 is in an inactive mode, A user dials the server number and, navigating through a system 
of voice menus supported by the server software, listens to and selects a particular sound clip in the same way as 
browsing the loaded sound clips in the autonomous mode. Instead of assigning a caller ID, the user may choose to 
download sound clips. 

When a user, navigating through the server voice menus, selects to download the current clip and in the 
embodiment using an accessory unit activates the accessory unit 204 through the predefined key sequence, the process 
of transmission of the selected sound clip is initiated. After selecting a "download" option, the user may press a specific 
key combination on the phone to switch the accessory unit 204 from the inactive to the active mode. The unit 204 then 
begins to interact with the server 206, using the analogue channel provided by the phone and network. The already 
established phone connection is used to receive information. The device may receive the sound clip selected by the user 
and download it into internal flash memory. 

At the end of a session, the unit 204 forces the telephone 202 to hang up and switches to the autonomous mode 
which enables the user to assign a new caller ID to the sound clip just received. When a user, navigating through the 
server voice menus, selects to download a clip and activates the accessory unit 204 through the pre-defined key 
sequence, the process for transmission of the selected sound clip is initiated. 

The selected sound clip is transmitted through the phone line to the accessory unit 204 first through the server 
hard drive 408, then through the server software 207, next through the voice adapter 210, then through the phone line of 
the network to the cellular service provider 208, to the cellular phone 202, and through the analogue interface 308 of the 
accessory unit 204, then through the processor 300 of the accessory unit 204, and finally, through the flash memory 
element 302 of the accessory unit 204. 

Generally, the server software 207 retrieves the selected sound clip from a database 212, converts it to the 
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special sequence of sounds modulates, transfers codes ofthese sounds to the voice adapter 210 that converts these codes 
to actual sounds and transfers these sounds to the phone line 214. From the phone line 214, the sounds go to a cellular 
provider 208 through to a radio channel, and to the cellular phone 202 itself (much like voice sounds are transferred 
during a normal phone conversation). The sounds then go through the connector and are received in analogue form by 
the board 203. The sounds are then converted by the device ADC (Analog to Digital Converter) to the digital form and 
are processed by the DSP (digital signal processor -"demodulated") 300 to thesame digital data form initially stored on 
the database 212 (e.g., in MPEG audio format). In this form, the sound clip data are written into the flash memory 302 
of the device 204. 

Following a reverse direction, going from the board 203 to the server 206 using the same chain, the device sends 
to the server either an "acknowledgement" of a successful delivery of the sound clip data or a list of data blocks 
received with errors so that these blocks can be resent in a second try. In order to transmit digital data through the 
analogue channel, a similar procedure is used to convert data to sounds and back. 

When all the data is transferred without errors, the board 203 signals to the server 206 that the call may be 
disconnected. Thereafter, the server 206 instructs the voice adapter 210 to hang up, freeing the phone line for another 
client, and the board 203 switches to the autonomous mode, allowing the user to assign a caller ID to the sound clip 
most recently downloaded. 

Example Using digital C ellular Network and Cellular Telephone 

The above-described accessory unit 204 is used in this example in the context of digital (capable of providing a 
dedicated data transmission channel) cellular network. Since in this case a digital channel is used for sound clip data 
transmission, no modulation is required on the mobile phone side. The server, on the other hand, uses a modulation 
protocol compatible with the protocol supported by the cellular network provider. Usually this can be accomplished by 
using a standard ITU-T-approved modem, like V.32 or V.34. 

Initially, the unit 204 is in an inactive mode. A user dials the server number and, navigating through a system of 
voice menus supported by the server software, listens to and selects aparticular sound clip in the same way as browsing 
the loaded sound clips in the autonomous mode. Instead of assigning a caller ID, the user may choose to download 
sound clips. 

When a user, navigating through the server voice menus, selects to download the current clip and activates the 
accessory unit 204 through the pre-defined key sequence, the process of transmission of the selected sound clip is 
initiated. After selecting a "download" option, the user may press a specific key combination on the phone to switch the 
accessory unit 204 from the inactive to the active mode. The unit 204 then begins to interact with the server 206, using 
the digital channel provided by the phone and the network. If possible, the already established phone connection is used, 
or a new connection is established specifically for digital data transmission. 

The selected sound clip is transmitted through the phone line to the accessory unit 204 first through the server 
hard drive 408, then through the server software 207, next through the voice adapter-modem 210, then through the 
phone line of the network to the' cellular service provider 208, to the cellular phone 202, and through the digital 
interface of the accessory unit 204, then through the processor 300 of the accessory unit 204, and finally, through the 
flash memory element 302 of the accessory unit 204. 

Generally, the server software 207 retrieves the selected sound clip from a database 212, transfers codes ofthese 
sounds to the voice adapter-modem that converts these codes to actual sounds, using one of the standard modulation 
protocols supported by the cellular provider (like ITU-T V.32 or V.34) and transfers these sounds to the phone line 214. 
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From the phone line 214, the sounds go to a cellular provider 208, where they are demodulated back into digital data 
and then the data goes to the cellular phone 202, through the radio channel, using the digital .channel provided by the 
cellular network. The data is then received by the processor of the accessory unit, and then written into the flash 
memory 302 of the device 204. 

Following a reverse direction, going from the board 203 to the server 206 using the same chain, the device sends 
the server either an "acknowledgement" of a successful delivery of the sound clip data or a list of data blocks received 
with errors so that these blocks can be resent in a second try. When all the data is transferred without errors, the board 
203 signals to the server 206 that the call may be disconnected. Thereafter, the server 206 instructs the voice adapter- 
modem to hang up, freeing the phone line for another client, and the board 203 switches to the autonomous mode, 
allowing the user to as sign a caller ID to the sound clip most recently downloaded. 

In order to provide a guaranteed and error-free delivery of digitally compressed sound clip data through the data 
channel provided by the phone and network, a special error detection and correction method is proposed. 
A Data Transmission Method with Error Correction Delivery 

A method for data transmission with error correction assumes a sufficiently low probability of error in the 
channel and implements error correction by re-sending the affected data blocks. The data (i.e. the compressed sound 
clip) is split into smaller data blocks by the server. Each block is supplied with a special header that, in particular, 
includes the block number and a cyclic redundancy code word for error detection, computed for the block data and 
header. Other error detection codes can also be utilized. The data blocks are then sent through the data channel 
sequentially. Using the redundancy code, the receiver (i.e. the mobile device) checks the correctness of each received 
block. The size of blocks is selected in such a way that 1) there is a high probability of error-free transmission of a 
block; and 2) the overhead introduced by additional control information (e.g. header, CRC word) is not high compared 
to the data payload. 

If the block size is selected properly, only a few blocks out of the entire sequence are usually affected by channel 
errors. These erroneous data blocks are re-sent by the Server upon receiving special requests. 

Depending on the availability of bi-directional data transfer, one of two protocols can be used. If the phone and 
network support simultaneous transmission of data in both directions, a protocol that uses simultaneous transmission of 
data in a server-to-phone direction and acknowledgements in a phone-to-server direction can be utilized. In this case, a 
special acknowledgement packet is sent for each valid data block received by the device. If a block is received with an 
error, a negative acknowledgement packet is sent. 

The sever software, receiving these control packets, either sends the next subsequent data block, until all data 
blocks are transferred, or resends the block received with error. When all data blocks are transferred, and the positive 
acknowledgement is received for the last block, the sound clip is considered to be completely delivered. An example of 
such interaction is shown in FIG. 16 for a protocol with individual packet acknowledgement for full-duplex channel, 
showing three data blocks 1600, 1602, and 1604 for corresponding data, indicated in the figure as e T>atal" "Data2" and 
"Data3", with corresponding headers "Hdrl", <c Hdr2", and 'HdrS". Positive acknowledgements "Ackl" and Ack2" are 
sent for packets numbers 1 and 2. Packet number 3 is originally received with an error, indicated by "Nack" and is 
subsequently re-sent to successfully correct the error, whereby a positive acknowledgement "Ack3" is Sent 

If the phone or network supports only unidirectional data transmission, the other protocol can be utilized to 
minimize the number of channel direction alterations. In this case, all data blocks for the sound clip are sent at once by 
the server, without receiving acknowledgements for the individual packets. Then, a single control packet is transferred 
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in the opposite (device to server) direction. This control packet contains a bit mask, with one bit for each data block 
received. Each bit in the bit mask has a u l" value if the corresponding data block was received without errors, or a "0" 
value if the corresponding block was affected by errors. The server then re-sends those blocks that were received with 
errors in the first pass. When all data blocks are transferred, and the acknowledgement mask without errors indication is 
received by the server, the sound clip is considered to be completely delivered. Example of such interaction is shown in 
FIG. 17, where the packet number 2 is originally received with an error and is successfully re-sent subsequently to 
correct the error. Note that only two "ACK" packets were sent during the entire procedure. 
Example Using Landl ine Telephony Network and a Conventional Home Telephone 

FIG. 12 illustrates an example of a landline telephony network system 1200 using a voice channel and a 
conventional home telephone 1201, according to a preferred embodiment (Although this system is described with 
respect to the accessing and delivery of sound clip files, it is to be understood that the system may be used for the 
accessing and delivery of entire files). 

The system 1200 has two main components: a home telephone accessory unit 1202 and a server 1205. The 
accessory unit 1202 is an autonomous unit, attached to the phone line 1208 and to the phone (between the line and the 
phone), and powered from the AC power outlet. A chip performing the same function of the accessory unit may instead 
be embedded in the phone itself. The purpose of the accessory unit 1202 is to support selection, downloading, and 
playback of sound clips according to the Caller ID of the calling subscriber. The accessory unit 1202 may include a 
speaker system and enables the phone to ring sound clips or perform other functions as described for the cellular phone 
102. 

The server 1205, which is associated with stored clip files 1206, which may be stored on a website or a non web 
holding unit, is designed to execute requests of the accessory unit 1202 either through user voice commands or 
commands using the phone keys, and allows for files to be transmitted to the accessory unit 1202 for storage. The 
system 1200 enables a connection to the server upon a request from the accessory unit 1202, utilizing the phone and 
PSTN (Public Switched Telephone Network), and an adapter 1204 (voice modem) attached to the server computer. 
The system 1200 may have a voice menu, which, after connection to the server, allows the user to listen to the server's 
menu and navigate through the system of voice menus using the phone's DTMF keys. The system may allow the user to 
select and download clips by allowing the user to listen to the clips presented by the server, select a clip, and issue a 
download command to the server. The server then sends the selected clip (e.g., in digital compressed form) using the 
Custom Data Transmission Protocol (CDTP) over the voice channel. 

The device 1204 may further interface with a home PC 1214 for downloading sound files to the device. The 
interface may be a plug in connection or may use a wireless network system. 

The accessory unit 1204 may be sold as a unit compatible to most home phones including cordless phones, and 
may connect directly to the phone jack, with the phone connected to the device. Similar to the cellular phone 102, a 
home phone may include an embedded chip, instead of the accessory unit 1204, for performing functions similar to 
those of the accessory unit 1204. 

The handset of a cordless phone utilizing sound clips according to the present invention may ring simultaneously 
with the box, wherein the handset may sound a regular phone ring or a sound clip ring, while the box plays a sound clip 
ring. 

The server comprises software shown in FIG. 14, which can run under Windows 98®, Windows-NT OS®, or 
other suitable system using a voice modem for communications. Additionally, the system may use a single modem or a 
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pool of several modems. 

Initially, the accessory unit 1202 is in an inactive mode. A user dials the server number and, navigating through 
a system of voice menus supported by the server software, listens to and selects a particular sound clip in the same way 
as browsing the loaded sound clips in the autonomous mode. Instead of assigning a caller ID, the user may choose to 
download sound clips. 

When a user, navigating through the server voice menus, selects to download the current clip and activates the 
accessory unit 1202 through the pre-defined key sequence, the process of transmission of the selected sound clip is 
initiated. After selecting a "download" option, the user may press a specific key combination on the phone to switch the 
accessory unit 1202 from the inactive to the active mode. The device 1202 then begins to interact with the server 1205, 
using the analogue channel provided by the telephone and the network. The already established phone connection is 
used to receive information. 

The selected sound clip is transmitted through the telephone line to the accessory unit 1202 first through server 
1205, next through the adapter 1204, then through the telephone line of the PSTN to and through the analogue interface 
1305 of the accessory unit (shown in FIG. 13), then through the processor 1301 of the accessory unit 1202, and finally, 
through the flash memory element 1302 of the accessory unit 1202. 

A schematic diagram of a board 1300 implemented in the accessory unit 1202 is shown in FIG. 1 3 . The board 
includes the following main blocks: a processor element 1301 [e.g., a Digital Signal Processor (DSP)], a flash memory 
element 1302, a Random Access Memory (RAM) element 1303, a bootstrap chip 1304, an analogue interface element 
1305, and a digital interface element 1306. 

The processor 1301 executes the device firmware, provides control for all other blocks and performs the 
computational tasks for the board. The tasks performed by the processor 1301 include: control of the board's units, 
monitoring of keys pressed by the user and processing of key-press events, reception of information from the computer 
through the computer digital interface, reception of caller ID information from telephony service provider, reception of 
sound clips through the phone analogue interface, unpacking and then playing back sound clips through a built-in 
speaker connected to the analogue interface of the accessory unit 1202, support of a voice menu-driven user interface, 
and performance of other auxiliary functions. 

The flash memory element 1302 contains the device firmware, and the sound clips which can be pre-loaded as 
well as downloaded from the server. The RAM element 1303 enables the processor to run faster and also holds buffers 
for unpacked sound fragments and processor service procedures. When the power is turned on, the bootstrap chip 1304 
loads the device. 

The Analogue Interface element 1305 includes a telephone interface element and a built-in speaker interface 
element The telephone interface element is used for input and output of signals when downloading sound clips from 
the server 1205. The speaker interface element with the speakers, plays all system sounds heard by the user including 
voice menus and sound clips. 

The digital interface element 1306 may include a computer interface element and other digital interface elements 
to the home network. The computer interface element may be used for various service functions such as downloading 
preprogrammed sound clips from the computer to the Flash Memory. 

In an autonomous mode, the accessory unit 1202 contains a number of pre-loaded sound clips. Initially, the 
accessory unit is in the inactive state. The accessory unit 1202 and telephone 1201 interact such that the telephone 1201 
sends to the accessory unit 1202; codes of all the keys pressed by the user. Upon receiving a particular sequence of 
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codes or when, for instance, a particular key is pressed for a prolonged period of time, the accessory unit 1202 switches 
to the active mode. In the active mode, the accessory unit 1202 may interact with the user via a voice menu-driven 
interface where voice messages, via a speaker, prompt the user to respond by pressing a selection of phone keys 
indicating the user's responsive selections. The accessory unit 1202 reacts to the user's selections by analyzing the keys 
being pressed. 

In the passive mode, the accessory unit 1202 may analyze messages being received from the telephone 1201. 
Upon receiving an incoming call, the accessory unit 1202 checks the incoming caller's phone number against the list of 
assigned caller ID sound clips in its memory and plays back the sound clip through the built-in speakers if the specified 
caller's phone number was assigned to this clip. Alternatively, the accessory unit 1202 may play back a default sound 
if the particular Caller ID was not assigned to any clip. 

A schematic diagram of the server software is shown in FIG. 14. The server software is used for servicing user 
requests through user interface element 1401 , and transmitting the selected sound clip through the phone line via music 
clip transmission element 1402. The user interface element 1401 decodes DTMF keys pressed by the user, and plays 
voice menu labels to the user. The voice menu interface includes hierarchical submenus to lead the user to the 
downloading of the desired sound clip, where control is transferred to the music clip transmission element 1401. 

The music clip transmission element 1401 downloads sound clips to the phone/independent of the user interface 
element, interfacing directly with the phone accessory unit. The music clip transmission element 1402 initially 
transmits the selected sound clip to the adapter 1404 for data transmission from the server to the accessory unit. When 
the sound clip transmission is completed, the task of the music clip transmission element is done, and the telephone line 
is released and control is transferred to the user interface element 1401 

Generally, the server software retrieves the selected sound clip from a server database 1403, which is associated 
with an audio data optimization and compression element 1405, converts the clip to the special sequence of sounds 
modulates, and transfers codes of these sounds to the adapter 1404 which converts these codes to actual sounds and 
transfers these sounds to the phone line 1406. From the phone line 1406, the sounds go through the PSTN and are 
received in analogue form by the accessory unit 1202. The sounds are then converted by the device ADC (Analog to 
Digital Converter) to the digital form and are processed by the DSP (digital signal processor) 1301 to the same digital 
data form initially stored in the server database 1403 (e.g., in MPEG audio format). In this form, the sound clip data are 
written into the flash memory 1302 of the accessory unit 1202. 

Following a reverse direction, going from the accessory unit 1202 to the server 1205 using the same chain, the 
device sends to the server either an "acknowledgement" of a successful delivery of the sound clip data or a list of data 
blocks received with errors so that these blocks can be resent in a second try. In order to transmit digital data through 
the analogue channel, a similar procedure is used to convert data to sounds and back. When all the data is transferred 
without errors, the accessory unit 1202 signals to the server 1205 that the call may be disconnected. Thereafter, the 
server 1205 instructs the adapter 1204 to hang up, freeing the phone line for another client, and the accessory unit 1202 
switches to the autonomous mode, allowing the user to assign a Caller ID to the sound clip most recently downloaded. 

The server audio data optimization and compression element 1205, utilizes a music compression algorithm 
outlined in FIG. 15, which converts common music files into compressed files in order to reduce the audio clip size for 
minimizing its download time, while maintaining predetermined audio quality. These files are downloaded and stored 
by the accessory unit 1202. 

Preferred Procedure for Audio Data Parametric Optimization and Compression 
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The method 1500 of compressing the files comprises the steps of a) conversion 1502; b) amplitude normalization 
1504; c) sample rate conversion 1506; d) pre-emphasis filtering 1508; e) amplitude normalization 1510; and f) 
performance of MPEG audio layer 3 (MP3) compression with the selected parameters 1512. The compressed files are 
then transferred to the server database. 

5 Step 1502 of conversion to mono only needs to be performed if the input file is in stereo and if the audio output 

subsystem of the target hardware is only capable of playing back mono audio. At this step all available information is 
included into the output audio by summing of the left and right channels to form a single mono output. 

After conversion, or if the file does not need to be converted to mono, compression begins with the step 1504 of 
amplitude normalization, wherein sample amplitudes in the file are normalized. This step is required for enabling audio 

10 utilization of all available dynamic range and for improving the computational accuracy of subsequent steps. In order to 
maximize preservation of original audio range, a fixed coefficient for the entire audio file normalization is used. The 
coefficient is obtained using input file analysis to "stretch" the input audio range over the maximum available range. 

Step 1506 converts sample rate of audio files to selected sampling frequency. The original audio clips may have 
various sampling rates (44100Hz, 48000Hz, 22050Hz, 1 1025Hz, etc.). After analysis of available hardware capabilities 

is an optimal sampling frequency, which provides the most adequate audio quality, is selected. Increasing the sampling 
frequency above the optimal sampling frequency would not significantly increase the perceptual audio quality, due to 
the limitations of the audio output subsystem of the accessory unit For example, for the cellular phone system of FIG. 
2, after analysis of available hardware capabilities and a series of perceptual tests, the 22050Hz sampling frequency was 
selected as providing the most adequate audio quality since the audio output subsystem of the accessory unit has a 

20 relatively sharp drop in response for frequencies above 10-12 kHz. 

In order to avoid aliasing effects when changing from higher to lower sampling rate, a low-pass pre-filtering with 
a cutoff slightly lower than the newNyquist frequency is applied before down sampling. For rates that are not multiples 
of each other, cascaded sampling rate conversion schemes are constructed to minimize memory consumption and 
improve performance. 

25 The step 1508 of pre-emphasis filtering, along with the re-sampling of the previous stage, takes into account the 

specifics of the audio output subsystem of the accessory unit, to achieve improvement of the perceptual audio quality, 
and to reduce the resulting audio size after compression. 

Since the speaker of the audio output subsystem of accessory unit is preferably very small, the resulting sound 
has very low power in the low frequency range. Therefore, providing output in the low frequency range is likely to be 
30 futile, as it would only increase the size of audio file without any perceptual improvements. Additionally, providing 
output in the low frequency range may create undesirable "overflow" effects for the speaker. 

For example, for the cellular phone system of FIG. 2, a! I frequency content below about 400Hz is removed from 
the audio. In order to make the audio more "perceptually rich" in the low-frequency range, frequencies around 600Hz 
are increased by about +6dB. The frequency range from 1200Hz to 8200Hz is kept unchanged. Then, starting from 
35 about 8200Hz the signal power is gradually increased, up to +1 5dB at the highest frequency (1 1kHz). This compensates 
for the drop in speaker transfer function at high frequencies and improves the listening experience. 

A set of subjective audio perceptual tests with various types of audio contents, using the wide spectrum of 
hardware of the target platform has proved that the above-described pre-emphasis significantly improves the perceptual 
quality of resulting audio. At the same time, reducing frequency contents in the ''non-significant" frequency regions 
4o allows reduction of the resulting compressed audio size, since the data bits are not allocated to non-used frequencies. 
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The described pre-emphasis procedure is implemented by a filtering with a FIR (Finite Impulse Response) filter, 
according to the formula: 



N-l 

where ^ are filter coefficients, 
x 

k is the k-th output audio sample, 
is the k-th output audio sample. 

The coefficients are fixed and computed for the particular sampling rate and the desired pre-emphasis 

response curve. The filter can be designed to have a linear phase response (this is actually guaranteed if the 
coefficients are symmetric), which would ensure absence of phase distortions to the audio. Since the delay introduced 
by the filter is not harmful for off-line processing, the filter size can be made rather large to approximate the desired 
response curve with a high precision. 

After completing the step of pre-emphasis filtering, normalization of the sample amplitude is once again 
performed. Since the filtering significantly changes the signal, the second amplitude normalization step 1510 is : 
required to convert resulting audio "loudness" to some pre-defined value. 

Proceeding to step 1512, the processed audio clip is compressed into an MPEG Layer 3 bit stream. The resulting 
bit rate (level of compression) can be varied to suit different needs. For instance, it can be made dependent on the 
source audio clip length, to make the compressed file fit into a pre-defined size. Alternatively, it can be made dependent 
on the anticipated delivery method (to create, for instance, a "buiJt-in" audio clip of a very good quality, or to make the 
audio clip of a very small size, for delivery through a slow channel). The compression parameters can also be selected 
so that the clip delivery time is a consiant independent of the actual link transfer rate. 
Technical Description of a Preferred Embodiment for a Cellular Phone Accessory Unit 
Electrical Schematics 

FIGS. 6 A-D illustrate the electrical schematics of a mobile phone accessory unit (The image of the printed 
circuit board, as rendered by Computer Aid Design Software is shown in FIG. 7). Initial boot-up of the processor is 
done from the EEPROM (Electrically Erasable Programmable Read Only Memory) using passive serial SPI (Serial 
Programming Interface) protocol. Thereafter, the boot loader code, read from the EEPROM, loads the main firmware 
from the Flash memory. The PLL (Phase Locked Loop) of the processor is programmed for 5x multiplication of clock 
frequency. 

Firmware debugging is carried out through the JTAG (Joint Test Action Group) port using standard TI (Texas 
Instruments) software. External RAM is mapped both to the program and data space at the same addresses and 
occupies all lower address space (64k). Flash memory (Serial Data Flash) is accessed using software emulation of SPI 
protocol. 

Audio code (Coder-decoder) works at approx. 22kHz sampling rate (both channels). The Mode Control 
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transistor selects the phone interface mode: either RS-232 control mode (closed state) or "hands free" mode (opened 
state). The phone itself does not support simultaneous usage of these two modes. 

Both channels of RS-232 work at 11,5200 baud rate. From the PC side CTS (Clear to Send) and DTR (Data 
Terminal Ready) signals are supported. From the phone side CTS and RTS (Request to Send) signals are supported, 
"with inverted polarities. Both channels employ hardware flow control. 

The analogue signal level at the phone input is about lOOmV RMS (Root Mean Square). The level at the phone 
output is about 600mV RMS. 
Cellular Phone with Accessory Unit 

A cellular phone 900 with an accessory unit 902, according to one embodiment, is shown in FIGS. 8 and 9 using 
the Ericsson R520 as an example. The accessory unit is housed in thin cover 904 (see FIGS. 1 0 and 1 1) providing a 
mounting body 906 for attaching the phone 900 to the unit, via an interface connector 800. The accessory unit contains 
the printed circuit board 802 and speaker, preferably along the thin portion of the body. 

The accessory unit is attached to the back of the phone using the phone connector 800. A snap mounting which 
utilizes a dimpled section on the phone case typically intended for a car phone holder may also be used. 

The accessory unit includes the server software and two voice modems, attached to the server. The mounting 
body contains all necessary electronic components. 

The tasks of the electronic components include playing back of a pre-loaded sound clip upon a caller ID 
notification reception and downloading new clips from the server. 

The body of the accessory unit preferably comprises a base 804, a thin cover 806, and a molded cover 808. All 
three body components are preferably made of high-quality aluminum-magnesium-copper alloy (duralumin) and are 
chemically covered with a protective oxide film using two different dyes (colored and black) for the two copies of the 
device. A dense rubber casing may also be used. Factors considered in selecting the body material include lightness (so 
that the accessory unit would not exceed the phone itself in weight), mechanical strength, and the quality of 
electromagnetic shielding properties for protecting the internal components from the waves radiated by the phone. 

According to one embodiment, the accessory unit that embodies the delivery system for a cellular phone 
attachment is about 1.5" x 1.5" x .25" and includes a small high fidelity built-in speaker. The accessory unit may 
connect into the AC adapter fixture in the bottom of a cellular phone. A dense rubber casing or glove may house the 
device to protect it. The inside of the rubber glove may have a molded cavity that the device will fit into. The glove 
may have a circle of small holes which line up with the device's speaker to allow full sound penetration. The device and 
glove may be sold in different design variations both for marketing purposes and for fitting the different cellular phones 
on the market. 

Additionally, the accessory unit may connect into a cable connector instead of an AC adapter jack so that 
different jacks can be used. The device may also be modified with four or five variations to fit the various cellular 
phone software systems, (not AC adapter variations) currently on the market The device, including a speaker, may also 
be made with several different adapter applications that would attach to a variety of different phones. 

A snap-on mount for fastening the accessory unit to the phone may be located in the base body. The main 
purpose of the thin cover is to provide the electromagnetic shielding. The molded cover 808 contains connectors and 
some other components. 

A simple.snap mechanism for attaching the accessory unit to the phone, similar to the installation of a cellular 
phone to a car phone holder, may be provided. 
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FIG. 20 shows the accessory unit by itself, detached from the phone. The accessory unit can be detached from 
the phone similar to the detachment of other accessories such as the phone charger (usually by applying a rotating force 
rather than pulling straight out). 

FIG. 21 shows the accessory unit uncovered.. It is preferable to leave the body of the accessory unit closed. 
Preferably, there are no glued, soldered, or other permanent junctions inside, however, the high precision in the. 
manufacture of some components could lead to their degradation after repeated assembly and disassembly. 

The accessory-to-phone mounting is preferably designed to withstand repeated attachment and detachment 
without degradation of the snap-on mounting or connector. Although the phone body is also durable, it is preferable to 
attach and detach the accessory by shifting the snap-on lock upward manually during the attachment procedure (like 
during detachment) to reduce wear of phone body near the latch. 

The PCB (printed circuit board), located inside the accessory unit, is a muJti-Jayer board which may have 0.2mm 
gaps, two solder mask layers, and a silkscreen layer. The board preferably carries all the components, as illustrated in 
the schematics, excluding connectors and the speaker. Two outer layers of the board are signal layers; two internal 
layers are ground and 3.3 V power plane. For convenience of the PCB assembly on modern plants, most packages are 
surface-mounted but not BGA. The board preferably does not contain any components requiring rare or custom-made 
equipment for their assembly. 

The phone connector is preferably selected to maximize the firmness of the attachment, taking into account 
significant dimensions of the accessory unit. It should be mentioned that the connectors are unique to the type of the 
phone used (Ericsson R520 and compatible, like R320 and T28, in this example). 

Factors in selection of the speaker for music playback included sound quality, which is primarily related to the- 
speaker size, compactness, and weight of the speaker, as it is desirable that the speaker not be thicker and heavier than, 
the phone itself. Depending on the available technology, there may be some tradeoff between good speaker quality and 
having a lightweight speaker. Speakers used in professional radio receivers-scanners may be a reasonable compromise 
since such speakers provide better than usual sound quality while possessing reasonable dimensions and weight Other 
options include either sacrificing weight and dimensions to increase sound quality or using the new so-called "ceramic" 
speakers that are now appearing on the market. Mention should be made that although using these speakers could 
provide better quality, special modifications to the device would be required since these speakers could not be directly 
substituted in place of the standard ones. 

It should also be noted that the bandwidth of the acoustic channel of the cellular phone which, in turn, is non- 
linearly compressed and transmitted over a digital channel of the phone, is much less than the bandwidth of the 
conventional landline phone and can deliver about 150 bytes per seconds data transfer rate. Conventional landline 
phone could deliver about 3700 bytes per seconds (V.34). Using better speakers in the phone would entail loading 
sound fragments of better quality (and, therefore, of bigger size), which would increase the time necessary to download 
a melody. The problem could be solved by using the GSM digital data channel directly which would provide a rate of 
about 1000 bytes per second for existing cellular networks and more than 7000 bytes per second for newly deployed 
systems. Alternative solutions include: having to tolerate an increase in the sound file or sound clip download time, 
downloading a melody from a local computer (the melody being delivered to the local computer by some alternate 
means), and redesigning the system to support conventional (landline) phones. In the latter case, due to the significant 
increase in the device body size, it may make more sense to use a stereo-effect (which is reasonable when the speakers 
of left and right channels have enough spatial separation). 
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Server Software Description 

The server described herein performs the following functions: 1) startup, detection of the modem, detection of 
the melodies available; 2) answering incoming calls; and 3) servicing requests of user via DTMF codes. 

Upon startup, the application requests the user to select which device to work with. Possible options include 
5 local test mode (0), modem on COM1 port (1), and modem on COM2 port (2). 

If the local test mode is selected, all sounds will be played back using the sound card of the local computer and 
the computer keyboard will be used to control the server (via numeric buttons instead of DTMF keyboard). This mode 
is primarily for system testing purposes. 

If one of the modems is selected, all sounds will be played back into the phone line using the selected modem, 
10 and the calling party's phone keyboard (DTMF tones) will be used to control the server. This is the normal mode of 
server operation. 

The answering of incoming calls is performed differently in the local and the normal modes. In the local mode, 
the application waits for the 6 R' key to be pressed to simulate remote party RING, while in the normal mode, the 
application waits for the RING signal from the modem. Then, in either mode, the application initializes the device used 

is (sound card or modem). In the latter case, the modem goes "off-hook" and plays back the greeting message and the 
main menu (e.g., 0 - End of the sessipn, 1 -Current, 3 - Next, 4 - Load). 

Thereafter, the application goes into calling party servicing loop. Exit from the loop is possible upon reception 
of DTMF code '0 9 (or its simulation using the keyboard) or after the 30-seconds timeout if no reaction is detected from 
the remote user. Additionally, if working with the modem, the loop is exited when short beeps ("BUSY") condition is 

20 detected on the phone line. In the local mode, the 'X' key also leads to the immediate exit of the application. 

The calling party servicing algorithm may work as follows: the software keeps the internal counter or number of 
the current sound clip. Initially, this number is "0" indicating that the clip is at the top of the list. Upon receiving the 
"1" command, the software plays back the clip with the current number. Upon receiving the "2" command, the software 
increases the number and plays back the melody, i.e., plays the next melody. Upon receiving the "3" command, the 

25 software decreases the number and plays back the melody, i.e., plays the previous melody. Upon receiving the "4" 
command, the melody download is simulated. For the obvious reason, this mode is not implemented yet. Upon 
receiving the "0" command, the application switches the modem "on-hook" releasing the phone line and returns to the 
incoming call waiting state. Upon encountering any other command, the application plays back a standard error 
message. At any moment, the server application can be aborted by pressing <Ctrl>C combination on the keyboard. 

30 The server application keeps a log file (e.g., named 'TrograrnName_N.Log") where N is a port number. 

Therefore, if two instances of the application are started, one for the modem on COM1 and the other for the modem on 
COM2, two independent log files will be created. The log file contains brief information about user and server actions, 
times of events, their main features, for example, state of the modem or the sound card. These files are intended to be 
sent to the software developers in case of problems but can be used for other purposes as well, for example, to estimate 

3s the server load. 

Due to the fact that the server application always plays a melody with the same quality as one would be able to 
hear through the conventional phone channel [monophonic, 8kHz-sampling rate (signal bandwidth up to 3 .7kHz)], the 
sound quality of the played back clips may be low. This is not related in any way with the quality of sound that would 
be digitally transferred to the client's phone when the melody is selected since listening to the clips from server through 
40 the phone network could not deliver better quality than the phone channel itself. For this reason, sound files 
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compressed in monophonic versus stereo form would be preferred since the rate of delivery would be fester, with no 
loss in playback quality from the phone. At the same time, when the clips are downloaded into the phone in digital form, 
significantly better quality could be delivered upon playback due to the perceptual compression; however, this would 
increase the transfer time. 

The server software could also be implemented to track which clips were sent to which user or subscriber. This 
information could then be tracked and reported to different third parties such as the Copyright Office, or performing or 
artists rights organizations or societies. 
Devices for Acces sing Sound and Image Files 

Electronic devices adapted to receive sound and image data, according to the present invention, may be provided 
with an attachment or built in mechanism for providing consumers with Internet based or Internet free access to a 
library of downloadable sound and/or image flies. Consumers may be. allowed to download free clips of a song, musical 
composition, or other sound recording or movie or other performance onto any of these devices for use as alerts. 

After hearing or viewing a clip, the user, preferably by the push of a button, may transact a purchase of the full 
file associated with the clip, which may be downloaded to the device in its entirety, or delivered to the user's address on 
an independent medium such as records, cassette tapes, CDs, videotapes, and DVDs. Such practice is intended to 
encourage the sales of sound and image files by giving the user the opportunity to quickly make an impulse purchase. 

A device for downloading and listening to music Files, which is similar to a walkman type I-pod™ device, but 
uses the same delivery method as described for the cellular phone comprises a speaker and/or an earphone set for 
listening to music with volume controls (such as Bose or Shure E5 universal earphones), and a server access element 
(which may be approximately the size of a credit card). Such a device may be used as a hand held portable music 
player, as well as a car radio or home system, and may include larger speakers for use as an audio system by businesses 
such as bars, restaurants and clubs. 

In addition to features which allow a user to access the server library, the device may include other features 
common to conventional MP3 players and/or Apple I-Pod™ devices. The server access element includes controls, 
which may be buttons, for accessing, browsing, and downloading files from the server to the device. Speed dial 
technology may be used for accessing the server. For browsing, a multi-task arrows button which allows the user to 
browse, listen to samples, and highlight specific selections may be provided. 

The server access element may include a small LCD monitor (approximately 1" x 1 .75") for text browsing the 
server library. A small microphone hole may also be included for allowing the user to browse the library using voice 
commands. The earphone set or speaker will enable the user listen to downloaded sound files. 

Downloaded files may be stored on a device storage list for accessing at all times, or deleted. Thus, the user may 
access a library containing a large number of sound files, and browse, download, and listen to music, without the 
Internet or the need to plug into a computer. The consumer may be charged a fee for each download, or may be able to 
purchase actual items, for delivery to an address indicated by the user, such as records, cassette tapes and CD's through 
the access element. Free clips which the user can download may induce the consumer into purchasing the entire sound 
file from where the clip originated. 

The device may also include a mechanism for allowing a user to store downloaded files on a medium, such as a 
card, independent of the device. To this end, the device may provide a slot into which a storage card may be inserted, 
such that when the device is full, files may be downloaded onto the card for emptying space on the device. A security 
mechanism may also be included to prevent intellectual property abuse, for example, by preventing users from playing 
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copied files on other devices as described above. Such devices may further include a monitoring feature, which would 
allow performing rights organizations such as ASCAP and BMI to keep track of music publicly played by business such 
as bars, restaurants, and clubs for the purpose of paying out royalties. 

A schematic diagram for a media file monitoring system 1800, according to a preferred embodiment, is shown in 

s FIG. 18, for use with an I-Pod™ type listening device 1802, wherein a consumer may purchase copyright registered 
media files which are downloaded wirelessly to the device 1802. The system 1800 includes an existing wireless 
network 1804 of 1.5G or more, a system monitoring server 1806, and a system content server 1808. The monitoring 
server 1806 monitors and counts every file delivered to the consumer device 1802, for monitoring and counting every 
file delivered to the device 1802. The server 1806 may track each individually titled file which may include 

10 information such as song title and artist name, purchase price, the consumer's name, and other identity information, time 
of delivery, and any other pertinent information. The server 1806 may also protect encrypted copyrighted files .from 
illegal file copying. The content server 1808 stores copyrighted digital media content licensed from multiple 
entertainment companies. Thereafter, monitoring information, including statistics may be transmitted (e.g. through the 
Internet) to a company or organization. The system described may also apply to a viewer device for monitoring image 

15 files. 

A portable laptop type viewer device, for accessing and viewing image and/or sound files, may comprise a 
wireless earphone set and/or speaker for listening to programs with volume controls, and a Personal Digital Assistant 
(PDA) with a monitor which may be approximately the size of a laptop computer. This device allows the user to access 
a server library containing a large number of movies, TV shows, cartoons, and other files, using either text or voice 
20 activation, without the need to plug into a computer or use of the Internet or other computer based wireless 
telecommunication system. 

The files may be categorized and subcategorized by type of file (i.e. movies, TV shows) then by title or name of 
main actors. TV shows may further be classified by providing a description for each episode, similar to a description 
provided in a TV Guide™. Other categories and subcategories of classification may be provided to allow the user to 

25 identify the exact file he wishes to access. 

The device allows the user to browse, download, preview, store and view his selections, (using text, voice, or 
button commands), wherein a fee may be charged by the provider for any or all of these options. The files may be made 
available as clips as well as in their entirety. The viewer device may include a folder containing previously downloaded 
image files that can be accessed at any time and deleted when desired. The library may be organized by categories such 

30 as type of show (i.e. movies, TV sitcoms), names of actors, show titles, sitcom description (e.g. as appearing in TV 
Guide™) etc. 

Method of Advertising nsing Delivery of Sound Clips 

The method of delivering sound and image files, in accordance with the present invention, can further be utilized 
as an advertising tool. To this end, any of the above-described systems carrying the library of sound and/or image files, 
35 which include a website and non-Internet accessible holding unit, may be used to expose the user to sponsored 
advertising messages. For example, a user calling the holding unit may hear advertising while the system is accessing 
the library. 

Other advertising opportunities may be provided by utilizing a phone or other electronic device using alerts 
according to the present invention. For example, the phone may ring with advertising gimmicks such as promotional 
40 messages. Such advertisement gimmicks may be played as default rings when no clip is selected for the ring. 
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Additionally, a phone may be programmed to play, or transmit advertisements spontaneously. Clips containing 
advertising messages such as jingles may also be provided. Advertising messages may be tacked onto a user selected 
clip of a popular song or the like. 

Method of Distributing Music and Audiovisual Works to Consumers 

A method of distributing music and audiovisual works to consumers while accounting to copyright owners of the 
works comprises: (a) Making available on a website various selections of works in various categories for review by 
identifying information and offering a portion of the work for hearing or listening, each work being coded internally 
with identification to a copyright owner or its representative; (b) Allowing consumers to select the viewable or 
listenable portion of the work for data storage online or for downloading to the consumers ■ electronic devices at home 
wherein the downloaded file being encrypted to only play on the consumer's electronic devices first receiving the 
download; (c) Optionally tracking those consumers who received the download of the portion of the work and 
reporting to the copyright owners or their representatives information concerning the download; (d) Allowing a 
consumer to return to the website to purchase and download a complete copy of the copyrighted work previously 
sampled by the consumer; (e) Conducting an online purchasing transaction and charging the consumer for the 
download; (f) Downloading a complete copy of the copyrighted work to the consumer in an encrypted fashion so as 
to be playable only in the consumer's electronic device and not exchangeable with third parties; (g) Tracking those 
consumers who received the download of the copyrighted work and reporting to the copyright owner of their 
representatives information concerning the download; and (h) Paying the copyright owners or their representatives a 
portion of the money received from the consumers for their downloading of the copyrighted work. 

The delivery system, according to the present invention will also integrate with future wireless technology, such 
as 3-G systems, as it becomes available, for offering enhanced capabilities for accessing, delivering, and using sound 
and image files. 

While the present invention has been described with reference to a preferred embodiment or to particular 
embodiments, it will be understood that various changes and additional variations may be made and equivalents may be 
substituted for elements thereof without departing from the scope of the invention or the inventive concept thereof. In 
addition, many modifications may be made to adapt a particular situation or material to the teachings of the invention 
without departing from the essential scope thereof. Therefore, it is intended that the invention not be limited to 
particular embodiments disclosed herein for carrying it out, but that the invention includes all embodiments falling 
within the scope of the appended claims. 

INDUSTRIAL APPLICABILITY 

It is an object of the present invention to provide a unique method for using sound and image clips as alert 
sounds for a variety of electronic devices. 

It is a further object of the present invention to provide a method for ringing a cellular telephone using actual 
sound files including sound clips which may comprise real music with human voice, various instrument sounds and 
other sound effects. 

It is a further object of the present invention to provide a software system which may be integrated into existing 
cellular telephone hardware for enabling the cellular telephone to access and utilize sound files including clips, without 
the need for extra hardware. 

rt is a further object of the present invention to provide an accessory attachment for cellular telephones and for 
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landline telephones which will enable the telephone to access and utilize sound files, including clips. 

It is a further object of the present invention to provide a security feature for devices capable of receiving and 
playing multi-media files for preventing consumer unauthorized dissemination of such files. 

It is a further object of the present invention to provide a tracking feature for devices capable of receiving and 
5 playing music files for providing performers and writers rights organizations with an accurate method of determining 
royalty right payments to registered performers and writers. 

It is a further object of the present invention to develop a website for browsing and for delivery of sound and 
image files including clips. 

It is a further object of the present invention to provide a method for selling and buying products associated with 
10 existing copyrighted music, movies, TV shows, and other recorded performances. 

It is a further object of the present invention to provide a delivery method for allowing a user to access, browse 
and download files that is independent of the Internet, and does not require a plug in or hand wired connection. 

These and other objects, advantages, and the industrial utility of the present invention will be apparent from a 
review of the accompanying specification and drawings. 
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_ CLAIMS 

WHAT IS CLAIMED IS: 

1. A method for delivering a sound or visual file from a central server or servers to an electronic 
2 device or devices for a user comprising: 

compressing the file using a compression algorithm; 
4 storing the file in compressed format on a storage medium; and 

transmitting the compressed file from said storage medium to the electronic device, wherein 
6 said compression algorithm comprising normalizing sample amplitudes in sound file, and said 

file playable back for the user with high quality reproduction and selectable by the user from the 
s central server or servers to be retrieved by the electronic device with or without a worldwide network 

connection. 

2. The method of Claim 1 further comprising storing the file on the electronic device. 

3. The method of Claim 1 further comprising playing back the file on the electronic device. 

4. The method of Claim 3 further comprising using the file as an alert message for the electronic 
2 device wherein the file is played to alert the user of the occurrence of a specific event. 

5 . The method of Claim 4 wherein the electronic device is a telephone, and wherein the file is a sound 
2 file played to alert the user of an incoming call. 

6. The method of Claim 5 wherein a segment of the sound file is played to alert the user to the call. 

7. The method of Claim 6 wherein said segment may be played repeatedly to alert the user to the call . 

3. The method of Claim 5 wherein the user associates the sound file with a specific caller's phone 

2 number, wherein the sound file plays when specific caller calls. 

9. The method of Claim 8 wherein a plurality of sound files are delivered and stored on said 
2 telephone, and wherein a user of said telephone may associate one specific sound file of said plurality of sound 

files with one specific caller from a plurality of callers. 

10. The method of Claim 5 further comprising: 

2 said telephone being shared by a plurality of users, wherein a plurality of different sound files 

are delivered to and stored on said telephone, and wherein 
4 a specific sound file of said plurality of sound files is associated with a specific user of said 
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plurality of users, wherein said specific sound file alerts the user of an incoming call where said 
6 specific user is the intended recipient of said incoming call. 

H. The method of Claim 1 wherein the electronic device is a cellular phone, home phone, computer, 

2 clock, watch, pager, door bell, car alarm, palm pilot, or personal calendar. 

12. The method of Claim 1 wherein a plurality of sound files are delivered to and stored on the 
2 electronic device. 

13. The method of Claim 1 wherein the electronic device includes an attachable accessory unit for 
2 receiving the sound files. 

14. The method of Claim 1 wherein said trararmtting step is wireless. 

15. The method of Claim 1 wherein said transmitting step is through the Internet or other computer 
2 based system. 

16. The method of Claim 1 wherein said transmitting step is independent of the Internet or other 
2 computer based system. 

17. The method of Claim 1 wherein said sound file is a personal recording recorded by a user of the 
2 device. 

18. The method of Claim 1 further comprising charging a fee to a user of the device, said fee being 
2 charged based on the number of clips received or their duration. 

19. The method of Claim 1 wherein the file is a segment of a full song, musical composition, or other 
2 sound recording. 

20. The method of Claim 19 further wherein a user of the device may transact the purchase of an item 
2 associated with said sound file upon hearing said sound file. 

21. An electronic device comprising: 

2 a recording method which can store a large number of clips, including sound clips, image clips, 

or a combination thereof, said clips capable of being actual recordings of live performances; 

4 a receiver which receives the clips from a remote server; and 

a player which can selectively play said clips, wherein 

6 said clips can be playable as alerts for the electronic device upon the happening of a 

predetermined circumstances such as an incoming call or other event or circumstance and wherein a 
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8 user of the device may select a clip to play for alerting of the specified event. 

22 ' The device of Claim 21 , wherein the receiver is a wireless device which receives the clip by way of 

2 wireless communication. 

23 ' Tte device of Claim 21, wherein the receiver has a hardwired connection to the internet for 

2 receiving the. clips. 

24 ' The device of Claim 21, wherein the receiver receives the clips by way of a standard hardwired 

2 telephone network without accessing the internet. 

25 * device of Claim 21 wherein said clips are edited snippets from full flies, and wherein the 

2 device is further capable of storing and playing full files including full sound files, full image files, or a 
combination thereof. 

26 - The device of Claim 23 , wherein said full files can be used as said alerts '. 

27 ' The device of Claim21, wherein said clips are taken from songs, musical compositions, cartoons, 

2 movies, television shows, personal recordings, or a combination thereof. 

28. The device of Claim 21, the device being a cellular phone, home phone, computer, clock, watch, 

2 pager, door bell, car alarm, palm pilot, or personal calendar. 

29 - The d evice of Claim 28 wherein the device is a cellular phone using sound alerts. 

30 - The device of Claim 29 wherein said sound alerts are used to indicate an incoming call. 

31 • rbs device of Claim 30 wherein said sound alerts are further used to identify the caller or callee. 

32 - device of Claim 31 wherein said sound alerts for identifying the caller may be selected by 

2 either the caller or callee. 

33 ■ ThQ device of Claim 21 including a collection of sound clips , image clips , or a combination thereof 

2 stored on the device. 

34 - The device of Claim 21 further comprising an accessory unit attachable to the device, said 
2 accessory unit being capable of interfacing with said device and containing said media storage of said clips. 

35 - The device of Claim 34, wherein said accessory unit further contains a player which can play said 
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36. An electronic device comprising: 

2 a medium storage drive to store a large number of sound files, image files, or a combination 

thereof, said files capable of being actual recordings of live performances; and 
4 a player which plays said files, wherein 

said files are used as alerts for the electronic device, and wherein a user of the device may 
e select a file to play for alerting of a specified event. 

37. A method of selling a sound file to a consumer comprising: 

2 providing a telephone or other electronic device capable of playing sound recordings, said 

telephone being operated by the consumer; 
4 providing free sound clips to the consumer for playing on the device; and 

providing the consumer an option to purchase the sound file, the option to purchase the sound 
e file selectable by using the telephone keypad, wherein the consumer may purchase the sound file 

through said means for purchasing the sound files, upon hearing a sound clip taken from the file. 

38. A method of selling a sound file to a consumer comprising: 

2 providing a telephone or electronic product capable of playing sound recordings, said product 

being operated by the consumer; 
4 providing free sound clips to the consumer for playing on the product; 

providing a virtual media storage containing said sound clips, wherein said consumer can 
e browse said virtual media storage and select sound clips to be downloaded to said product; 

providing an option to purchase said sound file, using the product, or said virtual media storage, 
8 or a combination thereof. 

39. The meHiod of Claim 38, wherein said virtual media storage is accessible through the product, and 
2 wherein purchasing said sound file comprises the step of indicating an intent to purchase said file by pressing 

at least one key. 

40. The method of Claim 38 , wherein said virtual media storage is accessible through the Internet, and 
2 wherein purchasing said sound file comprises the step of computer interaction on a keyboard indicating intent 

to purchase said file by way of the internet. 

41. The method of Claim 38, wherein said file is downloadable to the user's product in a file format 
2 which cannot be copied by third parties. 

42. A method of selling a sound or image file, or a combination thereof comprising the stepsof : 
2 (a) Providing an electronic device; 
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(b) Reviewing categories of music or image files from a selection by way of a 
telecommunications network; 

(c) Listening to truncated samples of the music or image files from the selection; 

(d) Selectively storing or downloading selected truncated samples of the music or image files 
free of charge; 

(e) Receiving the truncated samples of the music or image files on a portable electronic device 
for subsequent personal use such as using a ring tone, alarm or other alert; and 

(f) Subsequently purchasing the complete music or image file from those previously selected 
truncated music or image files by way of a telecommunications network. 

43 ♦ The method of Claim 42, wherein the file is downloadable to the electronic device, or contained on 

a physical storage medium. 

44 • Tn e method of Claim 43, said medium being a cassette tape, video tape, record, CD, or DVD. 

45. A cellular phone comprising: 

a receiving element for receiving preselected sound clips, said clips having high quality 
reproduction of musical compositions; and 

a player to play said clips with the cellular phone; 

wherein said clips are playable as ring alerts for when the phone receives a telephone call. 

46 * The phone of Claim 45 , wherein said receiving element comprises software integrated with existing 

hardware of the cellular phone. 

47 • The phone of Claim 45, wherein said player includes a software system integrated into existing 

hardware of the cellular phone. 

48. The phone of claim 45, further comprising a media storage to store a plurality of said clips with said cellular 
phone. 



A method of distributing music and audiovisual works to consumers while accounting to copyright 
owners of the works comprising the steps of : 

(a) Making available on a website various selections of works in various categories for review 
by identifying information and offering a portion of the work for hearing or listening, each work being 
coded internally with identification to a copyright owner or its representative; 

(b) Allowing consumers to select the viewable or listenable portion of the work for data storage 
online or for downloading to the consumers' electronic devices at home wherein the downloaded file 
being encrypted to only play on the consumer's electronic devices first receiving the download; 

(c) Optionally tracking those consumers who received the download of the portion of the work 
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and reporting to the copyright owners or their representatives information concerning the download; 

(d) Allowing a consumer to return to the website to purchase and download a complete copy of 
the copyrighted work previously sampled by the consumer; 

(e) Conducting an online purchasing transaction and charging the consumer for the download; 

(f) Downloading a complete copy of the copyrighted work to the consumer in an encrypted 
fashion so as to be playable only in the consumer's electronic device and not exchangeable with third 
parties; 

(g) Tracking those consumers who received the download of the copyrighted work and 
reporting to the copyright owner of their representatives information concerning the download; and 

(h) Paying the copyright owners or their representatives a portion of the money received from 
the consumers for their downloading of the copyrighted work. 
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AMENDED CLAIMS 

[received by the International Bureau on 25 November 2002 (25 J 1.02); 
original claims 37, 38 and 42 amended: new claims 50-142 added: 
clips remaining claims unchanged ( 1 2 pages)] 

36. An electronic device comprising: 

a medium storage drive to store a large number of sound files, image files, or a combination 
thereof, said files capable of being actual recordings of live performances; and 
a player which plays said files, wherein 

said files are used as alerts for the electronic device, and wherein a user of the device may 
select a file to play for alerting of a specified event. 

37. A method of selling a sound file to a consumer comprising: 

providing a telephone or other electronic device capable of playing sound recordings, said 
telephone or other electronic device being operated by the consumer; 

providing free sound clips to the consumer for playing on the device; and 
providing the consumer an option to purchase the sound file, the option to purchase the sound 
file selectable by using the telephone keypad or said device, wherein the consumer may purchase the 
sound file through means for purchasing the sound file, upon hearing a sound clip taken from the file, 
and wherein said sound clip can be played by said telephone or other electronic device with high quality 
reproduction. 

38. A method of selling a sound file to a consumer comprising: 

providing a telephone or electronic product capable of playing sound recordings, said product 
being operated by the consumer; 

providing free sound clips to the consumer for playing on the product; 

providing a virtual media storage containing said sound clips, wherein said consumer can 
browse said virtual media storage and select sound clips to be downloaded to said product; and 

providing an option to purchase said sound file, using the product, or said virtual media 
storage, or a combination thereof, wherein said sound clip can be played by said telephone or electronic 
product with high quality reproduction. 

39 - The method of Claim 38, wherein said virtual media storage is accessible through the product, and 

wherein purchasing said sound file comprises the step of indicating an intent to purchase said file by pressing 
at least one key. 



The method of Claim 38, wherein said virtual media storage is accessible through the Internet, and 
wherein purchasing said sound file comprises the step of computer interaction on a keyboard indicating intent 
to purchase said file by way of the internet. 

The method of Claim 38, wherein said file is downloadable to the user's product in a file format 
which cannot be copied by third parties. 
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42 - A method of selling a sound or image file, or a combination thereof comprising the steps of: 

(a) Providing an electronic device; 

(b) Reviewing categories of music or image files from a selection by way of a 
telecommunications network; 

(c) Listening to truncated samples of the music or image files from the selection; 

(d) Selectively storing or downloading selected truncated samples of the music or image files 
free of charge; 

(e) Receiving the truncated samples of the music or image files on a portable electronic device 
for subsequent personal use such as using a ring tone, alarm or other alert, wherein said truncated 
samples can be played on said portable electronic device with high quality reproduction; and 

(0 Subsequently purchasing the complete music or image file from those previously selected 
truncated music or image files by way of a telecommunications network. 

43 - The method of Claim 42, wherein the file is downloadable to the electronic device, or contained on 
a physical storage medium. 

44 • The method of Claim 43, said medium being a cassette tape, video tape, record, CD, or DVD. 

45. A cellular phone comprising: 

a receiving element for receiving preselected sound clips, said clips having high quality 
reproduction of musical compositions; and 

a player to play said clips with the cellular phone; 

wherein said clips are playable as ring alerts for when the phone receives a telephone call. 

46 * Tne P h ° ne of Claim 45, wherein said receiving element comprises software integrated with existing 

hardware of the cellular phone. 

47 * Tne phone of Claim 45, wherein said player includes a software system integrated into existing 

hardware of the cellular phone. 

48 - The phone of claim 45, further comprising a media storage to store a plurality of said clips with 
said cellular phone. 

49 - A method of distributing music and audiovisual works to consumers while accounting to copyright 
owners of the works comprising the steps of : 

(a) Making available on a website various selections of works in various categories for review 
by identifying information and offering a portion of the work for hearing or listening, each work being 
coded internally with identification to a copyright owner or its representative; 

(b) Allowing consumers to select the viewable or listenable portion of the work for data storage 
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online or for downloading to the consumers* electronic devices at home wherein the downloaded file 
being encrypted to only play on the consumer's electronic devices first receiving the download; 

(c) Optionally tracking those consumers who received the download of the portion of die work 
and reporting to the copyright owners or their representatives information concerning the download; 

(d) Allowing a consumer to return to the website to purchase and download a complete copy of 
the copyrighted work previously sampled by the consumer; 

(e) Conducting an online purchasing transaction and charging the consumer for the download; 
(0 Downloading a complete copy of the copyrighted work to the consumer in an encrypted 

fashion so as to be playable only in the consumer's electronic device and not exchangeable with third 
parties; 

(g) Tracking those consumers who received the download of the copyrighted work and 
reporting to the copyright owner of their representatives information concerning the download; and 

(h) Paying the copyright owners or their representatives a portion of the money received from 
the consumers for their downloading of the copyrighted work. 

50. A method for the playback of sound files comprising: 

providing an electronic device having a speaker; and 

utilizing a delivery method algorithm, a parametric optimization and compression algorithm, an 
error correction algorithm, or a combination thereof, for playing the sound files on said device, 
wherein 

said delivery method algorithm uses an orthogonal frequency-division multiplex 
modulation scheme to transmit data relating to the sound file for playback on said electronic device, 

said parametric optimization and compression algorithm uses specific parameters for 
compressing said data, and 

said error correction algorithm is used for said data transmitting over the voice channel of 
. a telephony network, wherein said error correction algorithm comprises splitting said data into 
smaller data blocks, and using an error detection code to detect possible error in each of said 
smaller data blocks. 

11 * The method of Claim 50 further comprising downloading firmwear to a memory element 

associated with the device, said firmwear comprising said delivery method algorithm, said parametric 
optimization and compression algorithm, said error correction algorithm, or a combination thereof. 

;2 - Tne method of Claim 51 wherein said device is a digital or analogue cellular telephone having a 

built in memory element to which said firmwear is downloaded. 

i3 - Tne method of Claim 51 wherein said device includes an accessory unit which directly attached 

to said device wherein said memory element, speaker, or both are located in said accessory unit. 
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54 « The method of Claim 50, further comprising storing the sound files on the device &r 61i a' unit"* 

associated with the device. 

55 • The method of Claim 50 wherein said device is capable of receiving said files in a compressed 

format and playing back said files. 
56 - The method of Claim 50 wherein said sound files are stored on a memory element of said 

device in a compressed format, 

57 • The method of Claim 50, further comprising providing a holding system containing said sound files, 

said holding system being accessible by the device for downloading user selected sound files to the device or 
to a unit associated with the device, said holding system being either Internet based or non-Internet based. 

58 • T^ method of Claim 57 wherein said holding system is non-Internet based and accessible via a 

telephone dial in connection. 

59 - Th e method of Claim 57 wherein a user of said device can interact with the holding system to 
select a sound file to be downloaded to said device via voice commands, phone keypad commands or other 
control button commands, or a combination thereof. 

60 - Tne method of Claim 50, wherein said sound files comprise unedited sound recordings, 
truncated samples of unedited sound recordings, or both. 

61 • The method of Claim 50 further comprising providing means for preventing sound files from 

being copied or transferred among multiple electronic devices. 

The method of Claim 61 , said means for preventing sound files from being copied or transferred 
comprising encoding said device with scrambling/ unscrambling wave capabilities, 

said scrambling/ unscrambling wave capabilities being unique to said device, such that when a sound 
file is delivered to said device, a unique scrambling wave is encoded in said file, and when said file is played 
back, a corresponding unique unscrambling wave is sent, such that the file can be played back with clarity. 

63 • The method of Claim 50, further comprising means for preventing sound files from being 

transferred among multiple electronic devices which are owned by different users. 

64 • The method of Claim 50, further comprising providing means for tracking information 

regarding the use and delivery of sound files by a user of said device. 

65 • The method of Claim 64, further comprising recording information, related to the use and 

delivery of sound files by a multiple number of users. 
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66 * The method of Claim 64, said information including the user's identity and idenfitybf a sound 

file played by said user. 

67 • The method of Claim 64, further comprising using said information to determine royalty payments 

to parties having copyright ownership in said files for a public performance of the files. 

68 - The method of Claim 67 wherein said parties are registered with a performing rights 
organization who makes said royalty payments to said parties. 

69 - The method of Claim 50 wherein said sound files are used as alerts by the device to indicate the 
occurrence of a specific event. 

70 - The method of Claim 69 wherein a user of the device can program the device to play a specific 
sound file or files to indicate said event. 

71 • The method of Claim 70, wherein said user can select the order and manner in which said 

specific sound file or files are played to indicate said event. 

72 - Tne method of Claim 69 wherein a user of the device can store a multiple number of sound 

files on said device in a specific order selected by said user. 

73 ■ The method of Claim 69 wherein said device is a telephone, and wherein said sound files are 

used to alert of an incoming telephone call. 

74 • The method of Claim 73 wherein a specific sound file or files are associated with a specific 

caller, such that said specific sound file or files are sounded to indicate to a user of said device that said 
specific caller is calling. 

75 • The method of Claim 73 wherein a caller of said incoming call may select a specific sound file 

or files to alert of said call. 

76 - Tne method of Claim 73 wherein said sound files used to alert of an incoming telephone call may be 

personal recordings made by a user of said telephone or by a caller of said incoming call. 

77 • The method of Claim 73 wherein said telephone is shared by multiple users, and wherein a specific 

sound file or files are associated with a specific one of said multiple users, such that said specific sound file or 
files are played to indicate of an incoming call for said specific one of said multiple users. 

78 ■ The method of Claim 50 wherein said sound files may be played on said device at an adjustable 
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79. The method of Claim 50, further comprising allowing a user to download a sound file to the 
device upon playing a truncated version of said sound file on said device. 

80. The method of Claim 50, further comprising charging the user a fee for downloading said 
sound file to the device. 

81. The method of Claim 50 wherein said sound file is an advertising message played on the device. 

82. The method of Claim 81 wherein said advertising message is played. 

83. The method of Claim 50 wherein said device further comprises a display screen for displaying 
an image file. 

84. The method of Claim 50 wherein said sound files may be transferred among multiple electronic 
devices. 

85. The method of claim 84 wherein said multiple electronic devices are owned by a single user. 

86. The method of Claim 50 wherein said device is a telephone having a wireless or land line service 
provider. 

87. The method of Claim 50 wherein a large number of files are capable of being stored on a 
memory element of said device. 



88. The method of Claim 50 further comprising providing a server for enabling a user to download 
selected sound files to the device through user's requests to said server. 

89. The method of Claim 88 further comprising storing a large number of sound files on a database 
associated with said server, said server utilizing said compression algorithm for converting common sound 
files into compressed files for downloading said compressed files onto said memory element associated 
with said device. 

90. The method of Claim 88 wherein a single server is capable of interacting with a variety of 
different networks, telephones, or other electronic device, or a combination thereof. 

91. The method of Claim 50 wherein said delivery method algorithm is used for transferring data 
through a phone line and receiver, based on a voice mode connection and DTMF signal interpretation. 
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92. The method of Claim 50 wherein said delivery method algorithm for transmitting said data 

comprises the steps of scrambling said data, mapping said data, converting frequency symbols of said 
data into time samples, adding a cyclic prefix to said data, converting said data from digital to analogue 
form, converting said data from analogue to digital form, or a combination thereof. 

93 - The method of Claim 92 wherein said delivery method algorithm further comprises the steps of 

unscrambling said data after scrambling said data, decoding said frequency symbols to bits, converting 
said time samples to frequency symbols, synchronizing said symbols, or a combination thereof. 

94 ■ The method of Claim 50 wherein said delivery mediod algorithm allows simultaneous use of voice 

communication and data transmission features during a single user connection session to a server holding 
collection of said sound files without requiring a user to switch the mode of connection. 

95 • Th e method of Claim 94, wherein a user can access said server via a telephone dial in 

connection without the need to use a wireless internet service providers. 

96 - The method of Claim 50, wherein said error" correction algorithm is used and wherein said 

error detection code is a cyclic redundancy code. 

97 • The method of Claim 50, wherein said error correction algorithm is used and wherein said 

transmission of said data through said network is unidirectional or bi-directional. 

98- The method of Claim 50, wherein said parametric optimization and compression algorithm 

comprises the steps of: 

normalization of sample amplitudes of said data; 

conversion of sample rate of said data to selected sampling frequency; 

pre-emphasis filtering; and 

compression of said data with selected parameters. 

The method of Claim 98 wherein said pre-emphasis filtering uses a finite impulse response filter. 

The method of Claim 50 wherein said sound file can be received by the device in either mono or 
stereo form, and wherein said sound file can be played back by the device in either mono or stereo form. 

101 . A method for data transmission over a voice channel of a telephone using an orthogonal 

frequency-division multiplex modulation scheme and comprising scrambling said data, mapping said data, 
converting frequency, symbols of said data into time samples, adding a cyclic prefix to said data, 
converting said data from digital to analogue form, or a combination thereof. 
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102 - The method of Claim 101, further comprising using an error correction algorithm, said error 
correction algorithm comprising splitting said data into smaller data blocks, and using an error detection 
code to detect possible error in each of said smaller data blocks. 

103 - The method of Claim 101 , further comprising using a parametric optimization and compression 
5 algorithm for compressing said data, said parametric optimization and compression algorithm using 

specific parameters for compressing said data. 

104. A method of preventing or limiting distribution of a copyrighted sound or image file by purchasers 

of the file comprising: 

providing a device capable of receiving and playing back the file, said device being capable of 
10 encoding the file with a scrambling wave, said scrambling wave being unique to said device; 

encoding the file with said scrambling wave once the file is received by said device; and 
playing the file on said device while sending an unscrambling wave to counter said 
scrambling wave, such that the file can be played with clarity. 

105 « The method of Claim 104 wherein said device is a telephone, and wherein said scrambling and 

15 unscrambling waves are functions of the telephone number. 

106 - The method of Claim 104 wherein the file is transmitted to a user of said device for a fee. 

107 • A method of collecting information regarding the public performance of copyrighted media content 

comprising: 

providing a device capable of receiving and playing back a media file containing said 
20 copyrighted media content; 

providing a tracking feature on said device for tracking information relating to the number of 
times a particular media file is played; and 

providing a central unit capable of transacting the sale of sound files which can only be played 
on said device, 

2 * wherein said information is related to the ownership and performance of a number of 

media files is complied by said tracking feature. 

A method of determining royalty distribution to be paid out to owners of copyrighted content for the 
public performance of said copyrighted content comprising: 

providing files containing said copyrighted content; 
30 providing a device for playing said files, said device including a tracking feature for recording 

information regarding the identity of the copyrighted content being played; 

collecting said information and providing said information to a performing rights organization; 
and 
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using said information by said organization to determine distribution of royalty payments to said 
parties. 

I09 - The method of Claim 108 wherein said information includes the identity of a copyrighted work 

being played, and the number of times the work is played over a specified period. 

1 10 * The method of Claim 108, further comprising preventing distribution of said files among 

different purchasers of said files. 

J J J • A website for selling media items comprising: 

a list of media samples taken from said media items, said samples being downloadable to a 
consumer owned electronic device to be played by the device as alerts, and comprising sound, 
image, or both; and 

means for enabling a consumer to purchase the media items associated with said samples, said 
media items including files comprising sound, images, or a combination thereof, which are 
downloadable to said device, or records, cassette tapes. CDs. DVDs, or video tapes containing said 
files, or a combination thereof, 

said sound being reproducible by said device at a quality comparable to the quality of 
sound as heard on an original sound track from which the sample was taken. 

1 12 ■ Tne website of Claim 111, further comprising a storage unit for allowing said consumer to 

store a number of said samples which are selected for storage by said consumer. 

1 13 - Tne website of Claim 111, wherein said consumer can further upload consumer recorded files, 

and store said files on said storage unit for later downloading to said electronic device. 

1 14 * Tne website of Claim 111, wherein said electronic device is a telephone, and said samples are 

used as ring sounds. 

1 15 - The website of Claim 1 1 1 wherein said electronic device is a computer, and wherein said 
samples are used as computer alerts. 

1 16 - The website of Claim 1 1 1 wherein said consumer may drag his mouse over various listed items 
which flash and play one at a time. 

117 - Tne website of Claim 111 wherein said items are listed in specific categories, such that when 
said user places his mouse over a specific category, an item from that category automatically plays, and 
information relating to the identity of said item is displayed. 
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1 18* The website of Claim 1 1 1 wherein said consumer is charged a fee for accessing the website, 

downloading said samples, or both. 

1 19 - The website of Claim 1 1 1 wherein said samples are available for consumer downloading for free. 

120. The website of Claim 111 wherein said consumer can indicate an intent to purchase an item 
directly upon the item being played on said electronic device or on the website. 

121. The website of Claim 111. wherein the website is associated with a consumer contest, wherein 
said consumer is required to interact with the website in order to participate in said contest. 

122. A method of doing business based on providing sound or image clips to consumers, comprising: 

providing the consumer with an electronic device capable of using sound clips, image clips, or a 
combination thereof as alerts for said device, said sound clips capable of being stored on the device 
and played on the device with a quality comparable to the quality of sound as heard on an original 
sound track from which the clip was taken; and 

providing a holding unit having a large number of said sound clips, image clips, or a 
combination thereof, said holding unit being accessible by said consumer for interacting with 
said holding unit and optionally downloading said clips onto said electronic device. 

123. The method of Claim 122 further comprising charging the consumer a fee for accessing said 
holding unit, downloading clips from said holding unit, or both. 

124. The method of Claim 122 wherein said holding unit is accessible via a phone connection. 

125. The method of Claim 122 wherein said consumer is may optionally browse through said clips, 
and select clips to download. 

126. The method of Claim 122 wherein said fee is based on the amount of time said consumer 
interacts with said holding unit. 

127. The method of Claim 122 wherein said consumer is charged a fixed fee for accessing the holding 
unit. 

128. The method of Claim 122 wherein said fee is based on the number of clips downloaded. 

129. The method of Claim 122 wherein said device is provided with pre-embedded clips such that the 
consumer can use said clips as alerts without having to download clips form said holding unit. 

130. The method of Claim 122 wherein said device is a cellular or landline telephone. 
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131. The method of Claim 122 wherein said holding unit is a website. 

132. The method of Claim 122 wherein said holding unit is accessible using the device itself. 

133. The method of Claim 122 wherein said holding unit is non-Internet based. 

134 - The method of Claim 122 further comprising selling items associated with said clips, said items 

including files comprising sound, images, or a combination thereof, which are downloadable to said device, or 
records, cassette tapes, CDs, DVDs, or video tapes containing said files, or a combination thereof. 

135. The method of Claim 134 wherein said clips are offered to the consumer for free. 

136. The method of Claim 134 wherein said clips are offered to the consumer via a telephone service 
provider. 

137. The method of Claim 136 wherein said service provider charges the consumer a fee for 
participating in a service offering said clips. 

138. A chip which can be incorporated into an electronic device, the chip comprising: 

means for enabling the device to play back sound files using a delivery method algorithm, a 
parametric optimization and compression algorithm, or both, wherein 

said delivery method algorithm uses an orthogonal frequency-division multiplex 
modulation scheme to transmit data relating to the sound file for playback on said electronic device, 
and wherein 

said parametric optimization and compression algorithm uses specific parameters for 
compressing said data. 

139. An electronic device comprising: 

means for playing sound files on the device as alert means, said means for playing said sound 
files comprising playing said files on the device such that the quality of the sound played is 
comparable to the quality of sound from an original sound track for playing said sound, and 

parametric optimization and compression means for storing said sound files on the device in a 
compressed format. 

140. The device of Claim 139, the device being a cellular phone, home phone, computer, clock, 
watch, pager, door bell, car alarm, palm pilot, or personal calendar. 

141. The device of Claim 139, wherein the device comprises a plurality of said sound files stored on the 
device, said files being stored on the device by a non-consumer entity prior to consumer purchase of the 
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device. 



142. The device of Claim 139, wherein the device comprises means for enabling a consumer to 

select and store specific sound files on the device. 
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